CA 02283458 1999-09-28 



M-l 
-H 

O 

0) 

Pi 

CO 

A) 

4J 
O 

m 

fO 

tQ 
3 
U 
O 
0 

o 
o 
-u 

Qi 
<U 

CO 



co 

0) 

o 
a 

& 

a) 
to 



o 5 

o p 
u 

0) CO 

l-f n 



<D 



a 

rl 
U 
Pi 



U 
O 



o 

•H 
4J 

cd 

4J -rl 
U Pi 
CO 9 



H 
H 
H 
> 

0) 



o 

Q OOOHHfMHCMCM 
M CMCN4CVJ(MCMCMCM<NCM 

O 
W 
CO 



r- oo 

CM CNJ 
CM CM 



1 ss? 



1 £ 



QJ 
U 
03 

cn 

<H 
<C 
4-» 

•r-» 

a 

03 



cm CD CD CD 
H H H 

O CD CD 

a o 

H Eh 
H Eh 

o o 

H H 

Eh Eh 

U U 

EH H 



O O IS 15 

u u u o 

H H H H 
UOUU 

H H EH H 
H Eh Eh H 



a cd 



u o o 

u o o 

O Eh 

Eh Eh 

O U 

EH H 
EH 



a o cd cd cd a cd o 

5» H EH O O 0> O * 

... Id 

Eh EH 



EH EH U . , 
H EH H Eh EH H 



a 

H 
En 

a 

EH 

u 

Eh 
Eh 



VSiiff 9BB888S8S1II8 BIB 

MMimmifMH 

' r'r'^Lluti^PHhh tn tT> Cn O fa CP CT> tX» 



z o 

CD 




EhEhEhEhEhEhEhEhEh 
H Eh Eh OHFjtj * Eh 




Eh H Eh Eh Eh Eh Eh 

U H EH EH EH EH EH 

id id Eh rt H 

O O 5 U U O U 

uou u u u u 




ID U O © O 15 © 



SEBeGSgSSSSS 




- o O U rt u u 



in *C < < < < < 
o 3 a o u u q 
ro o u u o o a 



OOCJOOOOUOUUU uo 



4J 

o 

CD 
■ — 1 

o 

2 



<U 0) 



a) a> 
ra ro 



ro ra 

8.8. 



O O 

ra fQ 

tp 

ra fa 



•H 
•H 

q to 

& q 
TJ fa 

^ O O 3 
ra -Q Cn 



CO 



co 
U 

c 



co 

H V) 

c: 3 

3 -H 

ra o 
cq to 



co co 

W 3 
^H CO 



•H 4J CO 



o 
x: 

^ o 

-M o 



ra «o 

ra 3 

> ra 



s H U 

o o 

5 U CD 

u u - 

o o u 
m 

CO < 

O C co 

m ra qj 

co co -H 

T3 4-> 

c = o 

ro co d) 
jj = 

co o 
- P 
to s H C 
CD OC 
CO s 

m 

u 



O 



Q) 
CO 

T3 

<L) 
+J 
U 
<U 

rH 

<u 
cn 



o 

U V 
O XI 



a n 

H >i Q) CO M-l 

-Q C O 

U -O co m 

C -H <U = O 

CD XI M & -P 

- C rd - 




M 

O 03 

CD U W U 

^ ^ = c 

C *J S M CO 0) 
CD O u_i d ^ 

§ -i >• i2 

&03 = xj 
u c: 

<D *h s nl : -n 

10 4J [H JJ H £ 

c g= w = 

H -fH n & 



m a) 

4-> CO 



CO 



o 



o 



119 



CA 02283458 1999-09-28 



00 
CD 
U 

a 

s 

a) 

4J 

§ 

0 m 

U w 

o o 

Pi 

-u O 
CO ^ 

. *J 
^ rd 

O N 

c -o 

O -H 
•H H 

0) ^ 



Q vo o> 

H C O 

CM CM 

a 
w 



ID 

CM 
CM 



• eg <vj cs» ro ro 
oj cm c\j cm cm 



o < < 

r* H e-. 



iJ 

•H 

a 

O 



to 
a) 

•D 



O 
CD 




3 
2 



BSSSssiiSSiiiBf 3£ 



« 1 1 1 1 1 

iliiiiffff * 



5lI|IIIIlilIl||I||g 
sHHIHHlllHlhf 



0 0 8 8 88 



o o p o 

' H H 



iH 
<D 
CO 

-a 
■u 

M 
O 



O 
-H 
*W 
-H 
O 

a> 

Pi 
OQ 
I 

0) 
D> 4J 

a> u 
cd i-s 

CO (Q 




opooooooo 
ts o o o cs 



poo 

HUH 



I1IIHI1I111! 



CD 

1 

no h 
oo e-4 
to o 

o tJ 
s e- 

o « 
o 3 

W CD 

Eh 
t" 
Eh 

CJ 

CD 
Eh 
U 



SB 
S3 

H H h 

o cd cd 

v CD O 



ii 



if f If til 

22««o«joo 
d d * 3 o « u o 




HHHI8HHIHI 



£ ft E £ « 

- y o +j 

Hill 
5 d o o 3 
d d o o 5 

H H S S g 

{HIV 

O CD CD CD CD 
CD CD CD CD CD 




Oi 
W 
CO 



a 

CD 
CJ 

5 

< 
CD 

Eh 
Eh 

CD 
CJ 

El 

CD 
CJ 
EH 
Fh 
CJ 

*c 

s 

Eh 



■■ <U 

2 s 

2 O 

rH 

Q 

W >, 

a* 

Cd T3 
CO CO 
*— 4-> 

m 

4J o 

C -H 
<y T> 

& 

CD 

c co . 
•"Eg. 

•■3 8 

^ 0) 5 
tj CJ 

£ o S 
u cr 2 
to to 10 

O id £ 

^ IH 



CD CD CD CD CD CD ' w 
CD CD CD CD CD CD CD 



s S 

6 e 

8 f 

g i 
2 S 



0) 



£0 

a 

e <D CD 
•Hum 

'H iJ 4J 
5 O O 
O <B (D 
-H ^ 

O c^ 
Hum 



CD CD 

•H -H 
4J +J 
CJ O 

«c m 

as 



CD 

0) tt) 

a a 

■u to io 

o o o 

»o q a 
"4 *H 

bi q q 

ni m qj 



01 

a to 

to « 

co c; cd 

-rs cj io 

S> Oi *H *H 

P q H »h 



cj a 



4 u o o *o -c? CD *S 



. CD c 

c cj»-h a tj 

i to a q 

-O >, Cr qj o 



CD ^ 
° £ 



"H CD 

a ra 

C3l -»h) Co 



0) 

a 



to to 



O In 4J 

2 o a h 



, CD 

o c 

e cd-m 

a tri 4j 

CD O -u 

5 ^ <Q IQ 



M c: a 
^ a *m 



cocococococo" coco 



t^acx^tocotQtoa 



1 -Q 'H O 



<o' « oj bJ „J ^ „• oj oj bj" «o eo to co oj *» «q" ej oj co' 



to to 

CD CD 

, u o 
3 c c 

•N CD CD 
M 3 3 

nj cr cr 

•*hj a) <d 
a to « 

*1 T3 "D 

CO -U 4-» 
CD O U 
> CD CD 

■ tu CD 
CO CO CO 



cS W > 

>-l CD 

• S «j 



•H 
T3 



g CD 

a 

C 4J 



CD 

a 

to 

•o o 

CD ~H 



rQ CO 

>i CD 



u o 

CD -H 

a +j 

to rtJ 



CD O 

u u 

CD rH 

tr u H 

" £ .c 

CD QJ H 

^ -a 



O 
CN 



CN 



O 

m 



CO 



120 



CA 02283458 1999-09-28 



CO 
GO 



o en H CM 

cd OD CO CO 

n ro c> fn ci 



S " m a. m i £ £ £ 

(N N N N tN m 



m 

<D 

o 
a 



a> 

CO 

s 

§ 

u 



O 



CD 



a? d 

is o 

ca £ 

4-1 as 
o o 

-H 

d *m 

O -rl 

■H H 
*-* Pi 

rH 

a> a 

CO -rl 

A u 

. * 
u w 

O i 
M-J Q) 

id 

4J ID 

m m 
m m 
■p b> 




ra 3 
cn < *$ s h 



b e u r 

H H H H 

o 5 o o 



isssssisas 



t) 2 O 

o 3 q 

ill 



5 o u o o o 



rt H 



SBBSSSBBBBS 



BP MHJH 
S -8 u 3 y B 



B B 55 S t> B 



S 8 8 | B 

liSISSlHllSSItsl 



oo e h 



b b e b b a g 



fM H H 



u u y O OH H 
u u o o u o u 




P u 



s 




ISBoooouBoguo 
+j u q 5 <J u 

tiflHIli 



O 

o 

CM V3 

6-" 
O O 

a % 

Q 6- 

M E-« 
O E- 

s 



8 



0888,888,8,8,888 

Ililm 11 " 



O 5 b O 



^ O H H «j c- — 

bouuuooouu 



ig§g3l 

111! 



h 5 g £ 



rt H rt H 




. . O H H 
O H P . 6* P 



iililillllll 



uooucJoSoBo 



S § a < 



2 5 

o 

H 

8 



H H H 



. . . u * 



£t3UHHHHfc«UU 



^ 4J 4) 

3 8 8 




ID dl HJ (D 1) 

nj tg nj hj u 

•H *H "H *H 

4J 4J 4J *J -U 

o o 
fa t> 



3 «0 



TO CO 0) > 
TO <XJ ^ «-H 



o c: 

6 



to to 
tJ ^ 
«o c: 
o 
c 



o 

. »a «o 

vu. 3 C 3 

_ tJ>--« tR TO Jq JU 

^ >, q TO 3 C 4) 



™ 2 S 52. K. S m 
TOTO»anjTOX5toD 4 Q J , O c> 5 




to 

rH 

4J CO 

a 

a ro 
o 

a) 

•H TO 

o 

tO -H 

a) -o 

TJ C 

•r4 -H 

4J 

o <o 

(1) -p 

^ O 

O D 

s « 

— w 

o <0 

00 U 
CO 

n 

o > 
z o 

rH 

a 

M >i 

O _ 

w -a 

CO <1) 
w 4J 

as 

4-> u 

C -h 

03 T3 

%5 

(0 

W 0) 

a> 

C to 
to 

<D -rH 

TO S 
4J . 

O a) 

TO O 
•H C 
TO 0) 

to tr 
a> 

to to 

3 
O 

O ra 
O J3 

o 

a o 

0) 4J 

a g 



o 



o 



4J a> 
o 

o c 

-P (U 

to & 
m a> 
(J) to 

Q) TJ 
M <U 
-P 

cn o 
G <U 

-r* rH 
M 0) 
CD CO 



3 X! 
C -P 

0) o 
U P 

c 

a) <-h 
^ ra 
tr o 
tu *H 

<D O) 



121 



CA 02283458 1999-09-28 



m 
o 



o 

m -rl 
"H -H 

rg id Oi 
y O m 

■8 ; i 

* m * 
«H -H . H 

CO f-< 

• 6 * 

o o - 

a -h 

id 4J 

3 * . .. 

o< o -a 

J) 'H >) 

a »w >a 
a a o 



kj. m *o m 



. CM 
r CN 



U £h H 




a 
•a 

to 




ggggiJgg 



u u a u u 



- - - 2*S $ & 
8888888 

5i 5i 5 c? o» tn o» 

u U U U U U U 



O O h h {J 

""ml 



8 8 8 8 8 

g 4J 4J g _ 

5* jj? *jf I? S 

UUHOOE-CJUO 



liiifiiiii 



»6 



p55S 



8 3 



id 




8SSS 



gg gg 

eggs 




>t IQ O N 

O) B -H -H 

a> hj m -a 

JJ O -H -H 

w id co jy 



rrj Q 

U M 
•H 

a w 

cd co 



O .d 

c 



5 

(J 

u 

H 

E-t 

O 



EH 

s 

cj 

Eh 
CJ 



-a 



.8 



E-i 

a h 

s 



5 5 



co O 
vO CJ 

og 

S3 

M CJ 

w g 



fO CO 

cd 

to 

•H U 
10 4-J 

•h e 

a to 
ra -H 
O 2 

d . 

•H CO 
0) 

CO CO 

cd fti 

T3 O 

•H 

■m in 

O 0) 

cu 5 

H O 

U rH 

2 >, 
_ X) 

— T3 
CO CD 
© 4J 

CJ 
H 

O TJ 

ss c 

Q 

m a> 

a !a 
w 

CO CO 

cy <o 



o 

4J -H 



r ° 

CO 

- TJ 
CJ) c 
ra 

^ u 

O CO 



M £ 

O 

co ci 

a u 

fO o 

CO CJ 

t M 

b: o 



g 

Q 

H 

w 

CO 



to 

^•^ 

CU ITJ 

4J 4J . 

fO CO Eh 

Jh" 

<U = ^ 

C CO o 

CU C 

cn 

CD ^ 
TJ Eh ^ O 

0) U 4J 



C2 



.H S 



(0 



ssefiiiiesifigieieeesfiss' 



o 



«o to 

•CJ fO 

O Cj 

«0 *0 



CO CO 

c; q 

CJ CJ 

•0 ^ 



to to to to 

■H *«H. ^ 

«0 <0 10 to 

•3 *? R q a 

<u cu oj <u ro q 

*H "*S *H **-| 4J Qj 

q c; c: q fa «m 

*H "1 •■H 14 »-H M OI 

vJ T3 *0 t> Cn J< 



co co 
ra q 



S "H tj -u 



M (U u 



•a m 

CO O 



- O 



OOcjOoddcJ o cj cj cj 



O O cj cj 



o 

CN 



ra trj 

rrj ^ 

§ s 

o ^ 

q <U 

rrj CJ 



0) -o 
9 OJ 



o u 

a cu 



o 

•H O 

«M » 

H jq 

a to 

a> Li 

a a 

to e 

I -H 

V) in 

a> a 



8, 5 

CO *J 

"D S 



4-1 VM 
CJ -H 



Oi CO 



O 
CO 



a> to 
a cu 

to X) 

i o 
to U 

a> a 

•H 

U C 
CD O 

a -h 

(0 4J 



H J3 

cu >i 
co J3 



CD CD 

C cj 
CD C 
CT> CD 

^ & 
U CD 
■U to 

CO 4-1 

c; ra 

O 4J 

U CO 
U xJ CD 

ra -d 

^3 4J 

T3 o CD 

S M 

rrj cj 

in c 

0) 

J5 - - T3 

4J O CD 

^ ^ C 

O »H 

4J Q r| 

H U 

CO CD 
HOtTj 

CD U C 

CO 3 

CD 

M CD 
CJ 

d 

CD 

•h d « 
)h tr 4-> 

CD CD rrj 
CO CJ 



O - 
U U CD 

*h o -a 

•S *" -H 

CO O 

CO TJ CD 

C d ^ 

O ra CJ 

■H 4J ^ 

4J co C 



U • 

•H CD 
'H CJ 

a d 

a cd 

re 3 
— cr 
CD 

4J CO 
CD <*H 

4J rj 



O 

mh 

cu 

TJ E-t CD 

■H x: 

4J ^ 4J 

o o 

CD »H 
MOO 

d o 



ra - 

a 

CD 

CO > 

d- o 

O X) 
•H « 

a 5 

a 4J 
d m 

•H O 



d 



| A 
CD 



J -o -a 

C CD d 

CD CJ 
CJ CD CD 

S co fl 

D* a> 

cd cd r- 

« jz: m 

4J 

CD 

x: o o 

4J Z , 



CD 

4-> CO O 
(fl Tj 4-1 

& S T3 
•H 4-> C 
CO CO -H 

d 

= = ro 
co u 
s >« 
^ U 4-> 
X} ra 

s y -3 

flop 

= < CP 

s o 

c M H 
O rrj 

= *w d 
K ra 
= co 

TJ CD 

= d *n 
S ra -h 

= 4J 4J 
CO O 
= CD • 

= S o 
= d 

£ Eh ra 



4J 

d 

CD 

§ 

r-t 

f 

CJ 

d i 

CD CD 
CD CO 

CD 

CD > 
t> CD 
rrj U 

jd 

CD 

8 ^ 

»H 

a cd 

CJ 



•Q co 



122 



CA 02283458 1999-09-28 



i 

■H 

-P 

cd 
o 

-rl 
4-4 
-H 



eSSSSSSaSS'SSSHSSSSSSSisSSS'?' 

o 
w 

CO 



O 
•rl 
4-1 
-H 

O 

0) 

cu 

CO 
l 

to 

O 
U 
O 

o 
o 

■2. 

I 

■u 

CO 

o 



ouuuoooo 



rl 6 

o o 

4-1 U 
4-1 

o> to 

<D ri 
0) 

«J 6 

M -H 
■P H 

CO Pi 



H 
H 

a 

s 




U B i» 



ISS99S3S88S 

s a % s s s * s s s 




igll 

mi 



U L> U CJ 
13 CD O O 



isissi 

(9 O O U O (9 



S 3 



o 

H 



u 

r- e-« 




<2 a: y 
u u o < 



iisasaitltl 




iissisiisssss 



Es is S 5 5 

Q V ^ y " 



88888811 
§ M § § § 



o 

I— I 
ro 



2 3 2 S < h < 

o 5 o O CJ u u 




5 3 < 5 < ^ 
D o u u u o 



*c<<<<<<<<<< < << <<<<<< 



H Eh M H * Eh id E 

8 8 ^ 3 8 S 3 I 

h ^ H H H h H I 

8 g 8 8 8 8 8 

§ 8 S 8 S B 8 ! 

w § S § I S S ! 

< < < rij rfj < < 



q 

O] 05 10 O 
0> to 3 3 3 E 
•H *H O U U O 
T3 "O -h -H *-< x; 

. H H 4J 4J U 

5 6 >i >i >■ *> 



to 



<0 

3 - - 

<jj 0) cd ai 

in M M H 



3 -H >H M -H ~H 7J 

h D, *o x; *H *H CD CD 0) 



q q q 

'H "H *H 

6 B § 



to 10 CO *H 

3 3 3 U 

CJ U U 3 

J "H. -H -H H 

to u -u -u U 
q ^ >, co 

* *s -5 *S 
q a a a*H 

3 O O O *H 
M M M a 

to a a-H 



M >H 

q q 

2 § 



CO 

O *H 
q +J 
»h X! 
H3 3 
St to 



to 

0) 

to u 

<b c 
q 0) 

&§• 

o cy 

-H O 0) 
•H q 4J 
o o u 

O 5 41 

r-i 

. . a> 

i-} CO 



r- u o D 
m <j r- Q 



G O Q U 
M E-i M H 



H 

I 



0) .. 

to w 

I Q> 

to u 

P c 

c cu 

0) 3 

o> tr 

Q) 

•O to 
0) 

O CD 

o e 

iH -H 

CP M 

co a 



O 
en 



u s c 

4J 4J s 



. _ o 

(1) H -H : 4-1 <U 

o a « T3 

ri I 8 ' I 5 

8 8 6 * 3 S 

t0 *H qi to O 

CU " 3 

CO CU 10 >- = C 

rH 4-> = S 

nj rj to s <o 

_LJ (J -r-i = 

X3 Cd •» CO 



C) CU T3 H 
Eh U *H 



123 



CA 02283458 1999-09-28 



Annex XIII: Strategy for the selection of the Staphylococcus 
genus -specific hybridization probe from tut 
sequ nces. 
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15 



20 



25 



30 



35 



S. aureus 

5. aureus 

S. aureus 

S. aureus 

S. aureus aureus 

5. aurlcularis 

S. capitis capitis 

S. caseolytlcus 

S. cohnll 

S. epldermldls 

S. haemolytlcus 

5. haemolytlcus 

S. haemolytlcus 

S. haemolytlcus 

S. homlnls 

S. homlnls 

S. homlnls homlnls 

S. homlnls 

S. homlnls 

S. lugdunensls 

S. saprophytlcus 

S. saprophytlcus 

S . saprophytl cus 

S, selurl sclurl 

S. warnerl 

S. warnerl 

S. warnerl 

S. warnerl 

B. subtllls 

E. coll 

L. monocytogenes 
Selected sequence 

Selected genus-specific 
hybridization probe: 



G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 
G 

G 

G 

G 

G 

G 

G 

G 

G 

G 

G 



400 

G TTGAAATGTT 
TTGAAATGTT 
TTGAAATGTT 
TTGAAATGTT 
TTGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TTGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TAGAAATGTT 
TTGAAATGTT 
TTGAAATGTT 
TAGAAATGTT 
GAAATGTT 



CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAATTA 
CCGTAAgTTA 
CCGTAAgTTA 
CCGTAAgTTA 
CCGTAAgTTA 
CCGTAAgcTt 
CCGcAAAcTg 
CCGTAAATTA 
CCGTAAATTA 



425 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
TTAGA 
CTTGA 
cTGGA 
CTAGA 
TT 



SEQ ID NO: 605 
GAAATGTT CCGTAAATTA TT 



SEQ ID NO: 
179 
176 
177 
178 
180 
181 
182 
183 
184 
185 
186 
189 
190 
188 
196 
194 
191 
193 
195 
197 
198 
200 
199 
201 
187 
192 
202 
203 

78 



The sequence numbering refers to the s+* n h v i~~~ 

fragment (SEQ ID NO- 179) 5^7° I ^ Staphylococcus aureus tuf gene 
the selected sequent or matcS tLt ^ Ca P itals identica? to 

indicated by lower easel sequence. Mismatches are 
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Annex XIV: 



Strategy for the selection of Staphylococcus 
saprophytics- *p cific and of Staphylococcus 
haemolyticus- sp cific hybridization prob s 
from tuf sequ nc s. 



10 



15 



20 



25 



30 



35 



40 



S. aureus 
S. aureus 
S. aureus 
S. aureus 
S. aureus aureus 
S. auricularis 
S. capitis capitis 
S. caseolyticus 
S. cohnii 
S. epidermidis 
S. haemolyticus 
haemolyticus 
haemolyticus 
haemolyticus 
hominis 

hominis hominis 
hominis 
hominis 
S. hominis 
S. lugdunensis 
S. saprophyticus 
S. saprophyticus 
saprophyticus 
sciuri sciuri 
warneri 
warneri 
via rneri 
warneri 
subtilis 
coli 

L . monocytogenes 
Selected sequences 



S. 

s. 
s. 
s. 
s. 
s. 
s. 



s. 
s. 
s. 
s. 
s. 
s. 

B. 
£. 



339 

AG TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TCGGTGAAGA 
AG TtGGTGAAGA 
AG TtGGTGAAGA 
AG TCGGTGAAGA 
AG TtGGTGAAGA 
AG TtGGTGAAGA 
AG TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
AG TtGGTGAAGA 
AG TCGGTGAAGA 
AG TCGGTGAAGA 
AG TCGGTGAAGA 



AG 
AG 
AG 
AG 
AG 



AG 
AG 
AG 
AG 
AG 
AG 



AG TCGGTGAAGA 



TG 
AG 
AG 
AG 
AG 
AG 
AG 
AG 



TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TtGGTGAAGA 
TCGGTGAcGA 
TtGGTGAAGA 
TtGGTGAcGA 



AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATt 
AgTtGAAATt 
AgTtGAAATt 
AgTtGAAATt 
AgTtGAAATt 
AgTtGAAATt 
AgTtGAAATt 
AATCGAAATC 
AATCGAAATC 
AATCGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTtGAAATC 
AgTaGAAgTt 



CGGTGAAGA AATCGAAATC 



ATcGGTtTaC 
ATcGGTtTaC 
ATcGGTtTaC 
ATcGGTtTaC 
ATcGGTtTaC 
ATcGGTATga 
ATcGGTATCC 
ATTGGTtTaa 
ATcGGTATgC 
ATcGGTATgC 
ATTGGTATCC 
ATTGGTATCC 
ATTGGTATCC 
ATTGGTATCa 
ATTGGTATCa 
ATTGGTATCa 
ATTGGTATCa 
ATTGGTATCa 
ATTGGTATCa 
ATTGGTATCC 
ATcGGTATgC 
ATcGGTATgC 
ATcGGTATgC 
ATcGGTtTaa 
ATcGGTtTaC 
ATcGGTtTaC 
ATcGGTtTaC 
ATcGGTtTaC 
AT cGGTcT tC 
gTTGGTATCa 
ATcGGTATCg 
• ATTGGTATCC 
A 



ATGACACaTC 
ATGACACaTC 
ATGACACaTC 
ATGACACaTC 
ATGACACaTC 
AaGACggTTC 
AcGAaACTTC 
cTGAagaacC 
AaGAagaTTC 
AcGAaACTTC 
ATGACACTTC 
ATGACACTTC 
ATGACACTTC 
AaGAaACTTC 
AaGAaACTTC 
AaGAaACTTC 
AaGAaACTTC 
AaGAaACTTC 
AaGAtACTTC 
AcGAtACTaC 
AaGAagaaTC 
AaGAagaaTC 
AaGAagaaTC 
cTGAagaaTC 
ATGACACTTC 
ATGACACTTC 
ATGACACTTC 
ATGACACTTC 
AaGAagagag 
AaGAgACTca 
AaGAagaaag 
ATGACACTTC 



383 SEQ 
TAA 
TAA 
TAA 
TAA 
TAA 
AAA 
TAA 
AAA 
CAA 
TAA 
TAA 
TAA 
TAA 
TAA 
TAA 
TAA 
TAA 
TAA 
TAA 
TAA 
CAA 
CAA 
CAA 
TAA 
TAA 
TAA 
TAA 
TAA 
AAA 
GAA 
AAA 



ID NO: 
179 
176 
177 
178 
180 
181 
182 
183 
184 
185 
186 
189 
190 
188 
194 
191 
193 
195 
196 
197 
198 
200 
199 
201 
187 
192 
202 
203 

78 



Selected species- 
specific hybridization 
45 probes : 



SEQ ID NO: 594 
ATTGGTATCC ATGACACTTC 

SEQ ID NO: 599 
CGGTGAAGA AATCGAAATC A 



fflho . nn refers to the Staphylococcus aureus tuf gene fragment (SEQ 
50 lv\rTlT SeotiLrfn r capi t a iS ar/iLntical to the selected sequence or 
50 matcS'tha? 'sequence. Mismatches are indicated by lower cases. 
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: Strategy for the sel ction of Staphylococcus 
aureus-specific and of Staphylococcus 
epidermidis-specific hybridization probes from 
tuf sequences . 



10 



15 



20 



25 



30 



35 



s. 
s. 
s. 
s. 
s. 
s. 
s. 
s. 
s. 
s. 

5. 

s. 
s. 
s. 
s. 
s. 
s. 
s. 
s. 
s. 

5. 

S. 

S. 

S. 

5. 
S. 
5- 
S. 
B. 
E . 
L . 
Se 
se 



aureus 
aureus 
aureus 
aureus 

aureus aureus 



auricularis 
capitis capitis 
caseolytlcus 
cohnll 
epidermldls 
haemolyticus 
haemolyticus 
haemolyticus 
haemolyticus 
homlnls 
homlnis 

homlnls hominis 
homlnls 
homlnls 
lugdunensls 
saprophy tlcus 
saprophytlcus 
saprophy tl cus 
sciurl scluri 
warnerl 
warnerl 
warnerl 
warnerl 
subtllls 
coll 

monocytogenes 
lected 
quences 



521 

TACACCACA 
TACACCACA 
TACACCACA 
TACACCACA 
TACACCACA 
TACACCACA 
CACACCACA 
TACtCCACA 
TACACCACA 
TACACCACA 
CACACCtCA 
CACACCtCA 
CACACCtCA 
TACACCtCA 
CACACCtCA 
TACACCtCA 
TACACCtCA 
TACACCtCA 
TACACCtCA 
TACACCtCA 
TACACCACA 
TACACCACA 
TACACCACA 
CACACCtCA 
TACACCACA 
TACACCACA 
TACACCACA 
TACACCACA 
CACtCCACA 
CAAgCCgCA 
TACtCCACA 
ACCACA 



TACTGAATTC 
TACTGAATTC 
TACTGAATTC 
TACTGAATTC 
TACTGAATTC 
cACTaAATTC 
cACTaAATTC 
TACTaAATTC 
cACaaAcTTt 
cACaaAATTC 
cACaaAATTt 
cACaaAATTt 
cACaaAATTt 
cACaaAATTC 
cACaaAATTC 
cACaaAATTC 
cACaaAATTC 
cACaaAATTC 
cACaaAATTC 
cACTaAATTt 
TACaaAATTC 
TACaaAATTC 
TACaaAATTC 
cACTaAATTC 
TACaaAATTC 
TACaaAATTC 
TACaaAATTC 
TACaaAATTC 
cAgcaAATTC 
cACcaAgTTC 
cACTaAcTTC 
TACTGAATTC 



547 
AAAGCAG. 
AAAGCAG. 
AAAGCAG . 
AAAGCAG . 
AAAGCAG. 
ActGCAG . 
AAAGCGG . 
AAAGCTG . 
AAAGCGG. 
AAAGCTG. 
AAAGCAG. 
AAAGCAG. 
AAAGCAG . 
AAAGCAG. 
AAAGCAG. 
AAAGCAG. 
AAAGCAG. , 
AAAGCAG. . 
AAAGCAG. . 
AAAGCTG . . 
AAAGCGG . . 
AAAGCGG. . 
AAAGCGG. . 
AAAGCTG. . 
AAAGCGG . . 
AAAGCGG. . 
AAAGCGG. . 
AAAGCGG. . 
AAAGCTG. . 
gAAtCTG . . 
AAAGCTG. . 
AAAG 



592 
. . TTCTTCtCa 
. . TTCTTCtC- 
. . TTCTTCtCa 
. . TTCTTCtCa 
. . TTCTTCtCa 
. . TTCTTCtCT 
► . TTCTTCAgT 
. . TTCTTCACT 
. TTCTTCAgT 
. TTCTTCACT 
. TTCTTCACa 
. TTCTTCACa 
. TTCTTCACa 
. TTCTTCACT 
. TTCTTCACT 
. TTCTTCACT 
. TTCTTCtCT 
. TTCTTCtCT 
. TTCTTCtCT 
. TTCTTCtCa 
. TTCTTCACT 
. TTCTTCACT 
. TTCTTCACT 
. TTCTTCACa 



AACTATCGtC 



SEQ 
617 
CACAATT 



ID NO 



AACTATCGtC 
AACTATCGtC 
AACTATCGtC 
AACTAcCGtC 
AACTAcCGCC 
AACTAcCGCC 
AACTATCGCC 
AACTATCGCC 
AACTATCGtC 
AACTATCGtC 
AACTATCGtC 
AACTATCGtC 
AACTATCGtC 
AACTATCGtC 
AACTATCGtC 
AACTATCGtC 
AACTATCGtC 
AACTAcCGCC 
AACTAcCGCC 
AACTAcCGCC 
AACTAcCGCC 
AACTAcCGCC 



CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CtCAGTT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 
CACAATT 



. TTCTTCAgT 
. TTCTTCAgT 
. TTCTTCAgT 
. TTCTTCtCT 
TTCTTCAaa 
TTCTTCAac 
TTCACT 



AACTAcCGCC 
AACTAcCGCC 
AACTAcCGCC 
AACTAcCGtC 
ggCTAcCGtC 
AACTAcCGCC 
AACTATCGCC 



CACAATT 
CACAATT 
CACAATT 
CtCAGTT 
CgCAGTT 
CACAATT 
CACA 



179 
178 
176 
177 
180 
181 
182 
183 
184 
185 
186 
189 
190 
188 
195 
196 
191 
193 
194 
197 
198 
199 
200 
201 
192 
187 
202 
203 

78 



Selected species- 
specific hybridi- 
zation probes: 



SEQ ID NO: 585 
ACCACA TACTGAATTC AAAG 



SEQ ID NO: 593 
TTCACT AACTATCGCC CACA 
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lnn „ xvi -Strategy for the sel ction of the 

Staphylococcus homxnis-sp cific hybridization 
probe from tuf sequences. 



S. aureus 

S. aureus 

S. aureus 

S. aureus 

S. aureus aureus 

S. auricularis 

S. capitis capitis 

S. caseoXyticus 

S. cohnii 

S. epidermidis 

S. haemolyticus 

S. haemolyticus 

S. haemolyticus 

S. haemolyticus 

S. hominis 

S. hominis 

S. hominis hominis 

S. hominis 

S. hominis 

S. lugdunensis 

S. saprophytics 

S. saprophyticus 

S. saprophyticus 

S. sciuri sciuri 

S. warneri 

S. warneri 

S. warneri 

S. warneri 

B. subtilis 

E. coli 

L. monocytogenes 
Selected sequence 

Selected species-specific 
hybridization probe: 



358 

ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTATgA 
ATC ATcGGTATCc 
ATC ATTGGTtTaA 
ATC AT cGGTATgc 
ATC AT cGGTATgc 
ATC ATTGGTATCc 
ATC ATTGGTATCc 
ATC ATTGGTATCc 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCA 
ATT ATTGGTATCc 
ATC ATcGGTATgc 
ATC ATcGGTATgc 
ATC ATcGGTATgc 
ATC ATcGGTtTaA 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTtTac 
ATC ATcGGTcTtc 
ATC gTTGGTATCA 
GTT AT cGGTATCg 
ATTGGTATCA 



383 

AtGAcACaTC TAA 
AtGAcACaTC TAA 
AtGAcACaTC TAA 
AtGAcACaTC TAA 
AtGAcACaTC TAA 
AAGAcggTTC AAA 
AcGAAACTTC TAA 
ctGAAgaacC AAA 
AAGAAgaTTC CAA 
AcGAAACTTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AAGAAACTTC TAA 
AAGAtACTTC TAA 
AAGAAACTTC TAA 
AAGAAACTTC TAA 
AAGAAACTTC TAA 
AAGAAACTTC TAA 
AcGAtACTaC TAA 
AAGAAgaaTC CAA 
AAGAAgaaTC CAA 
AAGAAgaaTC CAA 
ctGAAgaaTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AtGAcACTTC TAA 
AAGAAgagag AAA 
AAGAgACTca GAA 
AAGAAgaaag AAA 
AAGAAACTTC 



SEQ ID NO: 59*7 
ATTGGTATCA AAGAAACTTC 



SEQ ID NO: 

179 

176 

177 

178 

180 

181 

182 

183 

184 

185 

186 

189 

190 

188 

196 

194 

191 

193 

195 

197 

198 

200 

199 

201 

187 

192 

202 

203 

78 



tuf" gene fragment 
selected 



r,rr rpfers to the Staphylococcus aureus tuf 
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annex XXI = ^"^ubi^tou, primere £or DNA a„ pl i £i e ation 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



919 
920° 

921 
922 c 



935 
936 



937 
938 



Universal primers (recA) 

5 • -GGI CCI GAR TCI TMI GGI AAR AC 
5 » -TCI CCV ATI TCI CCI TCI AIY TC 

5 ' -TIY RTI GAY GCI GAR CAI GC 

5 ' -TAR AAY TTI ARI QCI YKI CCI CC 

Universal primers (radSl) 

5 ' -GGI AAR WSI CAR YTI TGY CAY AC 
5 1 -TCI SIY TCI GGI ARR CAI GG 

Universal primers (dxacl) 

5 » -ATI ACI GAR GYI TTY GGI GAR TT 
5'-CYI GTI GYI SWI GCR TGI GC 



918 
918 

918 
918 



93 9 
93 9 



940 
94 0 



437-459 
701-723 

515-534 
872-894 



568-590 
1126-1145 



1038-1060 
1554-1573 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: Miche i G \ Quebec City 

(i ) APPLICANTS: gQ^gsx^OT, Maurice , st-Augustin-de-Desmaures 

S^Ch^ia^'l^ert-de-Levis 
OUELLETTE, Marc 1 , Quebec City 
PICARD, Frangois J. 1 , Cap-Rouge 
ROY, Paul H . 2 , Loretteville 



45 



50 



55 



^Canadian citizenship 
American citizenship 



tt^xtt 1 t o"kt p t HHTiY CONSERVED GENES AN THEIR USE 
(ii) TITLE OF THE IJ™™L ^Tc G SjS SPECIFIC AND UNIVERSAL 
TO GENERATE SPECIES- S ^ I "m LI 5^ Tia H PRIMERS TO RAPIDLY 

PATHOGENSFROM CLINICAL SPECIMENS FOR DIAGNOSIS 



(iii) NUMBER OF SEQUENCES: 940 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: 

(B) STREET: 

(C) CITY: 

(D) STATE: 

(E) COUNTRY: 

(F) ZIP: 

1 (v) COMPUTER READABLE: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

c (C) OPERATING: 

(D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 

0 (A) APPLICATION: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME : 

(B) REGISTRATION NUMBER: 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : 

(B) TELEFAX: 
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2) INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

CAAACTCGTG AGCACATCCT TCTTTCTCGT CAGGTAGGTG TACCTTACAT *n 

CATCGTATTC TTAAACAAAT GCGACCTTGT TGATGACGAA GAATTACTTG inn 

20 AATTAGTAGA AATGGAAGTA CGTGAACTTC TTTCTACTTA TGACTTCcS tin 

^SI GACA CTCCAG ™" CCGTGGTTCA GCTCTTGCAG CG^SSSS 200 

TGAAGCTGGT CCTTACGGTG AAGAATCAGT TCTTGCTCTT GTAGCAGCAC lln 

TTGACTCTTA CATCCCAGAG CCAGAGCGTG CAATCGACAA AGCATTCTTG Inn 

ATGCCAATCG AAGACGTATT CTCAATTTCT GGTCGTGGTA CAGTAg5aI? \ln 

25 AGGCCGTGTT GAAGCTGGTA TCATCAAAGT TGGTGAAGAA GTAGaSt^G III 

ttggtattaa agatacagtt aaaacaactg taactggcgt agaIStctt? t in 

CGTAAACTTC TTGACGAAGG CCGTGCAGGT GAGAACTGTG GTATCTtSS tnn 

tcgtggtact aagcgtgaag aagtacaacg tggtcaagta cttgc?SS tin 

^ ^? GTACAAT CAAGCCGCAC ACTAAATTCG ACGCAGAAGT aSSSctS 

30 TCTAAAGAAG AAGGTGGTCG TCACACTCCA TTCTTAAATG GTTACCctS 122 

ACAGTTCTAC TTCCGTACAA CTGACGTAAC TGGTGCRATC SgTtSSg S 

aaggcgttga aatggtaatg ccaggtgaca acgttgaaat gtcaSSSa5 150 



35 



45 



2) INFORMATION FOR SEQ ID NO : 2 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 826 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



( Vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

nn^Z^S, CTCG TGGTCG CCGCGACCGA CGGCCCCATG GCCCAGACCC 

GCGAGCACGT CCTGCTCGCC CGTCAGGTCG GCGTTCCCAC CATCCTCATP ■■ ™ 

n^n^ CA AGTCC °ACAT GGTTGACGAC GAGGAAATGA T^AaSS? JSS 

55 cgaggaggag tgccgcgacc tgctggagtc ccaggacttc rSrS i 50 

ccccgatcgt ccaggtttcc gctctgaagg ccctcgaSg SSSr^IS ^ 00 

ssssss ss sags ™l Ulii in 

co SSSSE 3SSES ssi? ™£S 2SS 

450 
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CCAGAAGACC ACGGTCACCG GCATCGAGAT ? £CCACAAG TCGATGGACG 
AGGCATGGGC CGGCGAGAAC TGTGGCCTGC TGCTG CCATCACGCC 
GATGAGGTTG AGCGCGGCCA GGTTGTGGCC JTT^ GAAGAGGGCG 

T SSSSiS SSSSSro ^cgaISSc GTCCGCAGTT CTACTTCCGT 
GCCGTCACAA CCCGTTCTTC CCCGAGGGCA CCGACATGGT 

S C S S2SSS ScCGT TCAGCTGATC CAGCCCATCG 

CCATGGAGCC CGGCTGGGCT TCGCCA 



10 



2) INFORMATION FOR SEQ ID NO: 



(i) SEQUENCE CHARACTERISTICS: 
15 ( A ) LENGTH: 835 bases 

( B ) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Aerococcus vindans 

(B) STRAIN: ATCC 11563 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

rAACGTGACA ACGACAAACC ATTCATGATG CCAATTGAGG 

oc IS°£*iS£ aa^SSSgt CGTGGTACTG ttgctacagg tcgtgttgaa 

40 S25SSSS ifgSS^ ™™ 

AAGGTGGACG TCATACACCA TTCTTAACTA AAGACGTAGC 

SSSSS SSSSSm SSSt^aa ttgattcacc 

CAGTTGCGAT CGAAGATGGT ACTAAATTCT CTATC 



45 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 4 

( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: . , _ a , lKan 

(A) ORGANISM: AchroinoJbacter xylosoxidans subsp 

denitr-ificans 

135 
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(B) STRAIN: ATCC 15173 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



CCTGGTGGTG TCGGCCGCTG ACGGCCCGAT GCCGCAAACG CGCGAACACA 
TCCTGCTGAG CCGCCAGGTT GGCGTGCCGT ACATCATCGT CTTCCTGAAC 
AAGGCCGACA TGGTTGACGA CGCCGAGCTG CTTGAGCTGG TGGAAATGGA 
AGTTCGCGAR CTGCTGAGCA AGTACGACTT CCCGGGCGAC GACACCCCGA 



50 
100 
150 

_ ■ - • _ - — — w * * * * w ,i, x ucuouuttjHu ."CuA pnn 

TCGTGAAGGG TTCGGCCAAG CTGGCGCTGG AAGGCGACAA GGGCGAAC TG 250 
10 GGCGAACAGG C CAT CATGGC GCTGGCCGCT GCGCTGGACT CGTACATCCC 3 on 

350 
400 

GGTGAAGACG ACCTGCACGG GCGTGGAAAT GTTCCGCAAG CTGCTGGACC snn 
15 SiSSSJSS CGGCGACAAC GTGGGCATCC TRCTGCGCGG CACCAAGCGT 550 

650 

ACGACGGACG TGACGGGCAC GATCGAGCTG CCGGCCGACA AGGAAATGGT 7^n 
2 0 CCTGCCGGGC GACAACGTGG CCATGACGGT CAAGCTGCTG GCTCCg1t2S loo 
CCATGGAAGA AGGCCTGCGT TCGCCAC «-ltUATCG 800 

82 V 



— — ww-. www>_w>_.l vjcul x uu/iL 1 UlaxAUATUUU 

GACGCCTGAG CGTGCCGTTG ACGGCGCGTT CCTGATGCCG GTTGAAGACG 
TGTTCTCGAT CTCGGGTCGC GGCACCGTGG TGACCGGCCG TATCGAACGC 
GGCATCATCA AGGTCGGCGA GGAAATCGAA ATCGTCGGTC TGGTGCCGAC 



w*wws_#wrtj.ww i^l. x ou<otULjo L/iL L-AACj CJ U T 

GAAGACGTCC AGCGCGGCCA GGTTCTGGCC AAGCCGGGCT CGATCACCCC 
GCACACGGAC TTCACGTCCG AGGTGTACAT CCTGTCCAAG GAAGAAGGCG 
GCCGTCACAC TCCGTTCTTC CAAGGCTATC GTCCCCAGTT CTACTTCCGC 



2 5 2) INFORMATION FOR SEQ ID NO: 5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
3 ° (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Anaerorhabdus furcosus 

(B) STRAIN: ATCC 25662 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



TGGATCAATC CTAGTAGTTG CTGCAACTGA TGGACCAATG CCTCAAACTC 
S^™ T CTTACTTGCT CGTCAAGTAG GTGTTCCAAG AATGGTTGTA 
TTCTTGAACA AATGCGACAT GGTTGAAGAT GAAGAATTAA TCGACCTTGT 
TGAAATGGAA GTTCGTGAAC TTCTAAGTGC TTACGGTTTC GAAGGTGATG 
45 ATACACCAGT TATCCGTGGT TCTGCATTAA AATCTCTTGA AGGAAATGCT 
GATTGGGAAG CAAAAGTTGC TGAATTAATG GATGCAGTTG JSSJJSS? 
JS^TCCA ACTCATGAAA CAGACAAACC ATTCTTAATG GCTGTTgSg 
ATGTATTCAC AATTACAGGT CGTGGTACAG TTGCTACTGG ACrTCTTraa 

cgtggacact taaaccttaa cgaagaagtt ga^tcg??g SSSSSSf 
so tactaagaaa tcagttgtta ctggtatcga aatgttccg? SSSStaS 
actatgctga agcaggagac aacattggtg cattattacg tototttc? 

CGTGATGAAA TCGAACGTGG ACAATGTCTA GCTAAACCTG 
TCCACATACA gctttcaaag CTCAAGTATA CGTATTAACT ^^gaag 
GTGGACGTCA TACAC CATTC GTAACTAACT ACCGTCCtS ATT^otS 
55 ^ ACAACTG ACGTAACAGG AGTTGT TAAA CTTCCTGAAG qSctSSS 

ggttatgcct ggagacaaca tcgaaatgat cgttgaatta atcg??^ Itn 

TCGCTGTTGA ACAAGGAACT AAG AitbLlLUA 800 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 



823 



60 
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10 



15 



2) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

'^r^S-'Bacillus anthracis 

(B) STRAIN: 4229 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
CGGCGGTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCTCAAACTC 

gtgagcacat ccttctttct cgtcaagtag gtgtacctta CA 

TTCTTAAACA AATGCGACAT GGTAGACGAC CCAGGCGACG 
2 0 AGAAATGGAA GTTCGCGACC TATTATCTGA ATA ^ AGGAGAAGCT 

ACATTCCTGT AATCAAAGGT TCTGCTCTTA TQ ATGCT TACAT 

GATTGGGAAG CAAAAATCAT TGAAllAAi^ ^ CCTGTAGAGG 
CCCAACTCCA GAACGTGAAA CTGACAAACC ATTCTTAATG 
ACGTATTCTC TATCACAGGT CGTGGTALAO GTCTTGCTGA 
25 CGCGGTATCG TTAAAGTTGG TGACGTAGTA g^^g CGTAAACTTC 
AGAAAATGCT TCTACAACTG TAACTGGTGT AGAGATGT^ tcgtggggtt 
TTGACCAAGC TCAAGCTGGA GACAACATC^ GCGGTTCTGT 
GCTCGTGAAG ACATCCAACG TGC^CAAGTA TTTCGTATTA TCTAAAGAAG 
AAAAGCTCAC GCTAAATTCA AAGCTGAAGT ^CGTA TCAGTTCTAC 
30 AAGGTGGACG TCACACTCCA ™TTCOCTA A^ Q ^^ctca 

SSSSSo cSS acScgaaat GACTATCGAA CTTATCGCTC 
CAATCGCTAT CGAAGAGGGA ACTAA 



2) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 
4n (A) LENGTH: 829 bases 

* u / B ) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CGGCGGTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCTCAAACAC 

™E iiii IS S 

AGAAATGGAA GTTCGCGACC TATiAi^ aaqctCTTCA AGGAGAAGCT 
ACATTCCTGT AATCAAAGGT TCTGCTCTTA ^ ATGCTTACAT 
GATTGGGAAG CAAAAATCAT JGAATTAA1£ ^ CCTGTAGAGG 
35SE2E ^CTACTGG TCGTGTTGAG 



50 



55 



50 
100 
150 
200 
250 
300 
350 
400 
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500 
550 
600 
650 
700 
750 
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825 
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CGCGGTATCG TTAAAGTTGG TGACGTAGTA GAAATCATCG GTCTTGCTGA 
AGAAAATGCT TCTACAACTG TAACTGGTGT AGAGATGTTC CGTAAACTTC 
TTGACCAAGC TCAAGCTGGA GACAACATCG GTGCTTTACT TCGtc£S?S 
GCTCGTGAAG ACATCCAACG TGGACAAGTA CTTGCAAAAA gSottSS 
i£*S£S$F GCTAAATTCA AAGCTGAAGT TTTCGTATTA ScSSEES 
AAGGTGGACG TCACACTCCA TTCTTCGCTA ACTACCGTCC TCAGTTCTAC 
TTCCGTACAA CTGACGTAAC TGGTATCATC CAATTACCAG AMoSSS? 
AATGGTAATG CCTGGTGACA ACATTGAAAT GACTATCGAA CTTATCGCTC 
CAATCGCTAT masr.ir.p.p.s * * « ,T. m „ lAILW-it. 



CAATCGCTAT CGAAGAGGGA ACTAAATTC 



2 ) INFORMATION FOR SEQ ID NO: 8 

> (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

CGGTGCTATC ATCGTAGTTG CTGCTACTGA TGGTCCTATG CTTriaiPTP 
G CGAGCACAT CCTTTTGGCT CGTCAGGTAA ACGTTCCGAG ATtoS5?S5a 
^CATGAACA AGTGTGACAT GGTTGACGAC GAGGAAATGT TGgSSgOT 
TGAGATGGAG ATGAGAGAGT TGCTTTCATT CTATCAATTC QAOGPtASI 
ACACTCCGAT CATCCGTGGT TCTGCTCTTG GTOoSKS SSSSESS 

ataaagtaat ggagttgatg gaagcttgtg 
I^tctgcct ccgcgcgaaa tcgacaagcc gttcttgatg ccgg??Sg^ 

ACGTATTCTC AATCACGGGT CGTGGTACTG TTGCTACAGG TCrTarS^ 
JE55£ ATTO TTAAGGTTGG TGAGGAAGTT StcSSSS? 
^TGGTAAG AAATCTGTTG TTACAGGTGT TGAGATGTTC CGTaIct5a? 
n?%t^~ G TGAG °CTGGT GATAACGTTG GTTTGTTGCT TCGC^Irc 
GATAAGAATG AGATCAAGCG TGGTATGGTA ATCTGCCACC CGGPTrirrT 
TAAAGAGCAT TCTAAGTTCA AGGCTGAGGT TTATATCTTG f^SSS 

ss sssss? sssas sill SE 

S55SSS ?=§ TA ACG ™ ? SSS^SSS SSSSSS 



2) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cassellflavus 

138 
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(B) STRAIN: R763 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

PPTPrTtTGC CTCAAACACG TGAACACATC TTGTTATCAC GTAACGTTGG 
^CCTATGC JTCAAA^ AATGGATATG GTTGATGACG 

SSSKct agaStIgtt GAAATGGAAG TTCGTGACTT attgtcagaa 

AAGAATTACT AbAA^i aTCGCTGGTT CTGCTTTGAA 

iEE Sii S3 sssss sees 
liii S sssss = ss 
S Si ssss ssssss = 

^SScta 2gtggtgttg ctcgtgaaga catccaacgt ggacaagtat 
?g§c?SSg? ?gg?acaatc acacctcata caaaatttaa agctgaagtt 
?a?g?^S Saaagaaga aggtggacgt cacacacca 



20 



2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 bases 
2 c (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

lVi, X IN ^"^ Staphylococcus saprophytic- 

(B) STRAIN: CSG 197 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

^.n^iTTr TTTTATCACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT 50 

JSSSSE SSS^S S TACCGCCCAC VggSSZ 

go -SESS SES5SSS SSSSSS SSSSSS ST 3 *" .» 



2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

139 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides vnlgatus 

(B) STRAIN: ATCC 8482 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



10 TTATTGTTTG 
ATCCTGTTGG 
CAAGTGTGAT 
AAATGCGTGA 
ATTATTCGTG 

15 AGATAAAGTA 
CTCCGCGTGA 
TCTATCACTG 
TATC CATGTA 
AATCAGTTGT 

2 0 GAAGCTGGTG 
AATCAAACGT 
CTAAGTTCAA 
CATACTCCGT 
GGAC TGTACA 

25 CTGGTGATAA 
AATGTAGG 



TGCTGCAACT 

CTCGTCAGGT 

ATGGTTGATG 

ATTGCTTTCA 

GTTCTGCATT 

ATGGAGTTGA 

TATTGATAAA 

GTCGTGGTAC 

GGTGACGAAG 

TACTGGCGTT 

ATAATGTTGG 

GGTATGATTT 

AGCTGAGGTT 

TCCATAACAA 

GGTGAAATCA 

CGTAACTATT 



GATGGTCCGA 

AAACGTACCT 

ATGAAGAAAT 

TTCTATGATT 

AGGTGCTTTG 

TGGATGCAGT 

CCTTTCTTGA 

TGTTGCTACA 

TTGAAATTCT 

GAAATGTTCC 

TTTGTTGTTG 

TGTGTAAGCC 

TATATCTTGA 

GTATCGTCCT 

CTTTGCCGGA 

ACAGTTGAGT 



TGC CTCAGAC 
CGTTTGGTTG 
GTTGGAATTG 
TCGATGGTGA 
AATGGTGTTC 
TGATACTTGG 
TGCCTGTTGA 
GGTCGTATCG 
TGGTTTGGGT 
GTAAATTATT 
CGTGGTATCG 
GGGTCAGGTT 
AGAAAGAAGA 
CAGTTCTACT 
AGGAACTGAA 
TGATCTACCC 



TCGTGAACAC 50 

TTTTCTTGAA 100 

GTAGAAATGG 15 o 

TAATACTCCT 2 00 

CTCAATGGGA 250 

ATTCCATTGC 3 00 

AGACGTGTTC 3 50 

AAGCTGGTAT 4 00 

GAAGATAAGA 450 

GGATCAAGGT 500 

ACAAAAACGA 550 

AAGGCTCACT 600 

AGGTGGTCGT 650 

TGCGTACTAT 700 

ATGGTAATGC 750 

GGTTGCATTG 800 
808 



3 0 2) INFORMATION FOR SEQ ID NO: 12 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bartonella henselae 

(B) STRAIN: ATCC 49882 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

TGGTGCGATT TTGGTTGTTT CAGCTGCTGA TGGTCCGATG CCTCAAACAC 
GTGAGCATAT TCTTCTTGCC CGTCAGGTTG GTGTTCCAGC ^TTGTTGt5 
TTTCTTAATA AGGTTGATCA GGTTGATGAT GCTGAGCTTT TGGAGCTTGT 
TGAGCTTGAA GTTCGGGAGT TATTGTCGAA ATATGATTTT CCAGGAGACG 

50 ATATTCCGAT CGTTAAAGGT TCTGCTTTGG CAGCGCTTGA AGAtSSa? 
^GCATTG GTGAAGATGC GGTTCGTCTT TTGATGAGTG AACT^?^ 
TTATATACCG ACGCCTGAAC gtcctgttga TCAGCCGTTT TTGATGCCAA 
TTGAAGATGT TTTTTCGATT TCGGGTCGTG GAACTGTTGT GACGGGTCGT 
GTTGAGCGTG GTGTTATTAA GGTTGGTGAA GAAGTTGAGA TTAtSgS? 

55 TCGTCCAACT TCTAAGACAA CAGTTACAGG GGTTGAAATG TTC^SSS? 
TTTTAGATCA GGGGCAAGCG GGTGATAATA TTGGAGCGCT GCTt2SSS? 
^TGATCGTG AAGGGATTGA GCGTGGACAA GTTTTGGCGA AOCCTOOT? 

ggttacacct catacgagat ttaaagcaga ggcttacatt ttgacgSIg 

ATGAAGGTGG TCGTCATACT CCATTTTTCA CGAATTATCG TCCTCAGTTT 
60 TATTTCCGTA CTACGGATGT AACGGGAATT GTTACGCTTC SSSSotS 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
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AGAGATGGTT ATGCCTGGTG ATAATGTTGC ™GGATGTG TCTCTGATTG 800 
TTCCAATTGC CATGGAAGAA AAACTTCGTT TTGLlAiu 



2) INFORMATION FOR SEQ ID NO: 13 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 83 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

1 (vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentxs 

(B) STRAIN: ATCC 15703 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

TGGCGCCATC CTTGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 
^SrSS GCTGCTCGCT CGTCAGGTGG GCGTCCCGAA GATCCTCGTC 
ag?gcStat GGTCGACGAC GACGAGCTCA tcgagctcgt 
rTrrGTGACC TCCTCGACGA AAATGGCTTC GATCGCGATT 

25 gcc?gS?cat cScg?g?cc SSSSooq CACTGCACGA tgacgctccg 

rRrCACGAGA AGTGGGTTGA GCAGATCAAG AAGCT C ATGG ACGCCGTCGA 
tcaSaStc CCGACCCCGG TCCACGATCT GGACAAGCCG TTCCTGATGC 
o^^IaAa TPTCTTCACC ATCTCCGGCC GTGGCACCGT GGTGACCGGC 

30 SSSSJX g?gg?aSc? SSStcaac tccaacgtcg agatcgtcgg 
Stccgtccg acccagacca ccaccgtcac ctccatcgag accttccaca 

SSSXSS CGAGTGCGAG GCTGGCGACA ACACCGGTCT GCTGCTCCGC 
SgS?cSS gtgacSggt CGAGCGTGGC CAGGTTCTGG CTGCTCCGGG 
CtSotgIS CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 
iSfcWT CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 
35 otSccaccga CGTCACCGGC GTCATCACCC TGCCGGAAGG 

SSSSES SSESkSm GCGATCACGC TACCTTCGGC GTTGAGCTGA 
tcSgcSgat CGCTATGGAA gagggcctga ccttcgcag 



40 

2 ) INFORMATION FOR SEQ ID NO: 14 

( i ) SEQUENCE CHARACTERISTICS : 
45 (a) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium dentium 

(B) STRAIN: ATCC 27534 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 
gSaGCACG? gStGCTCGCT CGTCAGGTGG GCGTGCCGCG TATCCTCGTC 100 
GCCCTGAACA AGTGCGATAT GGTCGACGAC GAAGAGCTCA TCGAGCTCGT 150 

141 
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TGAGGAAGAG GTCCGTGACC TCCTCGACGA AAACGGCTTC GATCGCGATT 2 00 

GCCCGGTCAT CCACACCTCC GCCTACGGCG CGCTGCACGA TGACGCTCCG 250 

GACCACGACA AGTGGGTTGA GTCCGTCAAG GAACTCATGA AGGCCGTCGA 300 

CGAGTACATC CCGACCCCGA CCCACGATCT GGACAAGCCG TTCCTGATGC 350 

5 CGATCGAAGA TGTGTTCACC ATCTCCGGCC GTGGCACCGT GGTTACCGGC 4 00 

CGTGTCGAGC GTGGTAAGCT CCCGGTCAAC TCCAACGTTG AGATCGTCGG 4 50 

CATCCGTCCG ACCCAGACCA CCACCGTCAC CTCCATCGAG ACCTTCCACA 500 

AGCAGATGGA CGAGTGCGAG GCTGGCGACA ACACCGGTCT GCTGCTCCGC 550 

GGCATCAACC GTGACCAGGT CGAGCGTGGC CAGGTTCTGG CTGCTCCGGG 600 

10 CTCCGTGACC CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 650 

AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GTACCACCGA CGTCACCGGC GTCATCACCC TGC CGGAAGG 750 

CGTTGAGATG GTGCAGCCGG GCGATCACGC TACCTTCGGC GTTGAGCTGA 8 0O 

TCCAGCCGAT CGCTATGGAA GAGGGCCTGA CCTTCGCAG 839 

1 5 



2) INFORMATION FOR SEQ ID NO: 15 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 
30 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

TGGCGCGATC CTGGTGGTTT CGGCTGCTGA CGGCCCGATG CCGCAGACCC 
35 GCGAGCACAT CCTGCTTGCC CGTCAGGTTG GCGTTCCGGC GATCGTCGTG 
TTCCTCAACA AGTGCGAC CA GGTTGACGAT GCAGAACTGC TCGAACTGGT 
TGAACTGGAA GTGCGCGAAC TTCTGTCGAA GTACGAATTC CCCGGCGACG 
AAATCCCGAT CATCAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCC 
AAGGAACTGG GCGAAGATGC CATCCGCAAC CTGATGGACG CGGTTGACAG *nn 
40 CTACATTCCG ACCCCGGAAC GCCCGATCGA CCAGCCGTTC CTGATGCCGA 



TCGAAGACGT GTTCTCGATC TCCGGCCGTG GTACGGTTGT GACGGGTCGC 
GTTGAGC GCG GTATCGTTAA GGTCGGTGAA GAAGTTGAAA TCGTCGGCAT 
CAAGGCGACG ACGAAGAC CA CGGTTACCGG CGTTGAAATG TTCCGCAAGC 



50 
100 
150 
200 
250 



350 
400 
450 

TGCTCGACCA GGGCCAGGCT GGCGACAACA TTGGCGCGCT GATCCGCGGC IS!! 
45 GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGgSttP Inn 
TGTGAAGCCG CACACCAAGT TTAAGGCAGA AGCCTATATT SSSS^ tin 
ACGAAGGTGG CCGTCATACG CCGTTCTTCA CCAACTACCG TCCGCAOT?? ion 
CGACGGACGT GACGGGTGTT GTGACGCTTC CGGCTGGCAC 75S 
cn GGAAATGGTC ATGCCTGGCG ATAACGTCGC CATGGACGTT ACCCTGATCG 8 00 

50 TGCCGATCGC CATGGAAGAG AAGCTTCGCT TCGCTATC A ^ CiCjATCG 8 00 



2) INFORMATION FOR SEQ ID NO: 16 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 771 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



142 
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(ii) MOLECULE TYPE: Genomic DNA 
(vi)ORI G INAL^URCE: Burjdioider . a 

( B ) STRAIN: LSPQ 2217 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
10 GGCAGCAGAC GGCCCGATGC CGCAAACGCG TGAGCACATC CTGCTGGCGC 
GTCAGGTTGG CGTTCCGTAC ATCATCGTGT iC TTCGCGAACT 
GTGGACGACG C C G AACTGCT CGAGCTGGTC GA GTGAAGGGTT 
CCTGTCGAAG TACGACTTCC CGGGCGAC^A U£ CGAAGTGGCG 
CGGCGAAGCT GGCGCTGGAA GGCGACACGG GCGAGCTG^ cgccggagc g 
15 ATCATGAGCC TGGCCGACGC GCTGGACACU TQ xTCTCGATCT 

TGCAGTTGAC GGCGCGTTCC JGATGCCGGT £ GG catcgTGAAG 

CGGGCCGCGG TACGGTGGTG ACGGGTCGTG TGAAGACGAC 

gtcggcgaag aaatcgaaat cgtcggtatc aa ccaa GGTCAAGCA g 

CTGCACGGGC GTTGAAATGT TCCGCAAO^ cqAAGCGTGA AGACGTGGAG 
20 GCGACAACGT TGGTATCCTG CTGCGCGGCA ^ ACACGCACTT 
33K8E 5S£S£ ||S5g Ogjgggg -TCACACGC 

« C GC T C SSS gSItStGA TGCCGGGCGA 750 
25 CAACGTGTCG ATCACGGTGA A 



30 



35 



2) INFORMATION FOR SEQ ID NO: 17 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(B) STRAIN: ATCC 33431 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
45 GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 
TGAGCACATC CTGCTGGGTC GTCAGGTTGG OOTTOOQ^ GGAACTGGTA 
TCCTGAACAA ATGCGACATG °™TGAC^ c CGGGCGACGA 

GAAATGGAAG TTCGTGAACT J£TGTCCCAG TA£ GGCGAAGCAG 
TACTCCAATC GTTCGTGGTT CTGCTCTGAA QGA TTCTTACATC 

50 AGTGGGAAGC TAAAATCGTT GAGCTGGCTG GCTA^i ^^^0^ 
CCTGAGCCAG AGCGTGCTAT CGATAAGCCG TTCCTGCTC cGTGTAGAGC 
CGTATTCTCC ATCTCCGGCC GTGGTACCGT TGTi TATCAAAGAT 
GCGGTATCAT CAAAGTTGGT GAAGAAGTTG JJ*gO aactgCTGGA 
ACTGCGAAAT CTACCTGTAC CGGCGTTGAA ^ GGTATCAAAC 
55 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT ^ CTCTATCAAG 
GTGAAGAAAT CGAACGTGGT CAGGTACTGG AAGACGAAGG 
CCACACACCA AGTTCGAATC TGAAGTGTAC Ai TTCTACTTCC 
CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCC CGTTGAGATG 

60 SSSSSS caaaatgSt? gttaccctga TCCACCCAAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 



50 
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CGCGATGGAT GACGGTCTGC GTTTCGCAA 



5 2) INFORMATION FOR SEQ ID NO: 18 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neteri 

(B) STRAIN: ATCC 33855 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



40 

2) INFORMATION FOR SEQ ID NO : 19 

( i ) SEQUENCE CHARACTERI STI CS • 
< A ) LENGTH: 82 7 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

^ ( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



35 



30 



25 




50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
824 




50 
100 
150 
200 



144 



CA 02283458 1999-09-28 



»- a • i 

CCCCAATCAT CCGTGGTTCT GCTCTGAAAG CGCTGG^GG CGAAGCAGAG 

TGGGAAGCTA AAATCGTTGA GCTGGCTGGC JTCCl^ ATCGAAGACG 

AGAACCAGTA CGTGCAATCG ACCTGCCGTT CCTGCIO TGTAGAGC GC 

T ATT CTC CAT CTCCGGCCGT GGTACCGTTG JKACCG^ TCAAAGATAC 

5 GGTATCGTTA AAGTGGGCGA AGAAGTAGAA ATCGT1 CTGCTGGACG 

TGCGAAATCT ACCTGTACTG GCGTCGAAAT faliv- tATCAAACGT 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC CTATCAAGCC 

gaagaaatcg aacgtggtca ggttctggct aagcca^ gacgaaggcg 

GCACACCAAG TTCGAATCTG AAGTGTACAT CTACTTCCGT 

10 GCCGTCATAC TCCGTTCTTC AARGGCTACC GTCCACA^ TAGAGATGGT 
JgESSS SSKES TACCCTGATC CACCCAATCG 

c^atSacga cggtctgcgt ttcgcaa 



15 



2) INFORMATION FOR SEQ ID NO: 20 

( i ) SEQUENCE CHARACTERISTICS : 
on (A) LENGTH: 831 bases 

* ( B ) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CLW 029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GCGGAGCTAT CCTAGTCGTT TCAGCTACAG ACGGAGCTAT OCCACAAACT 
AAAGAACATA TCTTGCTAGC TCGCCAGGTT c QAACTTATTG 

TTTCTTGAAT AAAGTAGATA TGATCTCTCA AGAAGAI^ AGGCTACAAA 
ACCTTGTTGA GATGGAACTT AGTGAGCTTC JT^^™ QC 
GGATGCCCTA JTATCCGTGG JTCTGCTTTG AAAGCTC^ QfaMJ ^ 
AAATTATATC GAAAAAOTTC GAGAA^^ CTTTCTTAAT GCCTATCGAA 
TCCCTACACC AGAAAGAGAA ATTGATAAGC CTT1C1 jl ^^tcgA 
GACGTATTCT CAATCTCTGG JCGTCGTACT GTGGTT GGATTAGGAG 
GCGTGGAATC JTTAAAGTTT CTGATA^GT TCA ^ gMM3MCST 

AGACTAAAGA AACAATCGTT ACTWrfww TTACTCCTCA GAGGTATTGG 
CCTGAAGGTC GTGCAGGAGA AAACGTTGGT TTACTCCT 
AAAGAACGAT GTTGAAAGAG GTATGGTGG1 QAAAGAAGAA 

AGC 2^t? SJSSiS Sagacctc AGTTCTTCTT 

GGCGGACGTC ATAAGCCTTT "1^- TCTTCCTGAA GGAACTGAAA 

?™ SSSSSE ££££££ atgttgagct cattggaaca 

GTTGCTCTTG AAGAAGGAAT GAGATTTGCA A 

2) INFORMATION FOR SEQ ID NO: 21 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doub 1 e 

(D) TOPOLOGY: Linear 
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827 



25 



55 



60 



145 



CA 02283458 1999-09-28 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia psittaci 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

TGGAGCGATT CTCGTTGTTT CCGCTACTGA CGGTGCGATG CCTCAGACCA 



AAGAACATAT TCTTTTGGCG AGACAGGTTG GTGTTCCTTA CATCGTTGTT i nn 

10 TTCCTTAACA AAATCGATAT GATTTCTCAA GAaStGCTG i SS 

200 
250 
300 

15 ACGTATTCTC PATTTPT^t n«^™^ ^i^^f UTj L ' CGAAG 350 

400 
450 
500 

20 ATCTCACACA CAATTTai^ ^UCUTGAA 600 

650 

CGTACAACAG ATGTTACAGG ^S55GK £££££ gSSgSS? ?SJ 
GGTTATGCCA GGCGATAACG TTGAATTCGA AGTTCAATTA StSSSSJ 1™ 
TAGCTCTAGA AGAAGGTATG AGATTT ■!■ wiftTTA ATTAGCCCAG 800 

25 826 



CTTAGTTGAA ATGGAATTGT CCGAACTTCT E5£X£ SSSSSSS 

TATCCGTGGT tctgctttga aagccttaga aggtgatgca 
agctacgttg aaaaaattcg cgagttaatg caagcagtgg atgataacat 
ccctactcca gagcgtgaag ttgataagcc tttcttaatg cctattoSg 

ACGTATTCTC TATTTCTGGT CGTGGTACTG TGGTCACAGG ACOTmSbS 
CGTGGAATCG TTAAAGTGGG TGATAAAGTA CAGATTGTTG gSSESSS 
TACTAGAGAG ACAATTGTTA CCGGTGTGGA AATGTTCAGA AAAGAACTTC 
CAGAAGGTCA AGCAGGGGAA AACGTTGGTT TGCTCCTCAG AGGTATCGGT 
AAGAATGACG TTGAACGTGG TATGGTTATC TGC CAACCTA ATAGCOTgS 
ATCTCACACA CAATTTAAAG GTGCTGTCTA CATTCTACAA Aa!25£2£ 

S2J^ GTCA TAAACCTTT ^ tttaccggat acagacctca SJiSiSSS 



2) INFORMATION FOR SEQ ID NO: 22 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 
40 (B) STRAIN: LGV 12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

GGGGCTATTC TAGTAGTTTC TGCAACAGAC GGAGC TATGC err***™** 

45 ^§ss sssss sgssss 3 

SSS3SS 5S33S SSE 23sds IlEs 2oS 

CATACATAGA GAAAGTTCGA GAGCtStGC £SS£S§£ SSSfJS?^ 250 
50 £CTACTCCAG AAAGAGAAAT TGACAAGCCT ?otSK££ l°,°r> 
CGTGTTCTCT ATCTCCGGAC GAGGAACTGT AGTAACTGGA £gTA??g1S£ Inn 

gtggaattgt taaagtttcc gataaagttc agttggtcgg S?5ES^ 400 
actaaagaaa cgattgttac tggggttgaa atgttS??? ISJ™~ t 450 
agaaggtcgt gcaggagaga atgttggatt mSSSJ SJ™^ c 500 

55 AGAACGATGT GGAAAGAGGA ATGGTTGTT? G???SSa> ^^° GTA 550 

cctcatacac ggtttaagtg tcctctt-S? ^rSSiJJ J?™*™^ 600 

=s sssks ?EH §iP ? "°° 
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10 



15 



2) INFORMATION FOR SEQ ID NO : 23 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ^ GIN ^SS! : chzyseo 1? acteriu 0I1 neningosepticu nl 

(B) STRAIN: CDC B7681 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

GAGAACACAT C CTACTTTOC CCAGAATTGT TAGAGCTTGT 150 

= lii* ssgsss ssssss sr. 
25 sssss fssgg ~s ««ss ssss rsS 

£££££ SKSS^SS TAGCAACTQG TAGMTCG&G 400 

iEE ibss s 

30 SS 5SSS5SS ««» «™- Jggmgr ». 
C^CGCTCT TAACGAGGGT CTTAGATTCG CGATC 835 



4 0 2) INFORMATION FOR SEQ ID NO: 24 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 
45 (c) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: . . o 

(A) ORGANISM: Citrobacter amalonatxcus 

(B) STRAIN: ATCC 25405 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

55 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

Stcagcacat CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

?SotSaCA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

SaSSISa" gttEgtgaac TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 

so 5X5^23$ Sttcgtggt TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 
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GAGTGGGAAG CGAAAATCAT 
CCCGGAACCA GAGCGTGCGA 
ACGTATTCTC CATCTCCGGT 
CGCGGTATCA . TCAAAGTGGG 
5 GACTGCCAAG TCTACCTGTA 
ACGAAGGCCG TGCGGGTGAG 
CGTGAAGAAA TCGAACGTGG 
GCCGCACACC ATGTTCGAAT 
GCGGCCGTCA TACTCCGTTC 
10 CGTACAACTG ACGTGACTGG 
GGTAATGCCG GGCGACAACA 
TCGCGATGGA CGACGG 



CGAACTGGCC GGCTTCCTGG 
TTGACAAGCC GTTCCTGCTG 
CGTGGTACCG TTGTTACCGG 
CGAAGAAGTT GAAATCGTTG 
CTGGCGTTGA AATGTTCCGC 
AACGTTGGTG TTCTGCTGCG 
TCAGGTACTG GCTAAGCCGG 
CYGAAGTGTA CATCCTGTCC 
TTCAAAGGCT ACCGTCCGCA 
CACCATCGAA CTGCCGGAAG 
TCAAAATGGT TGTTACCCTG 



ATTCTTACAT 3 00 

CCGATCGAAG 350 

TCGTGTAGAA 40 0 

GTATCAAAGA 45 0 

AAACTGCTGG 500 

TGGTATCAAA 550 

GCWCCATCAA 600 

AAAGACGAAG 650 

GTTCTACTTC 700 

GCGTTGAGAT 750 

ATCCACCCGA 800 
816 



15 
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2) INFORMATION FOR SEQ ID NO : 25 



( i ) SEQUENCE CHARACTERISTICS : 
(A). LENGTH: 825 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter braakii 
(BJ STRAIN: ATCC 43162 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

CGCGATCCTG GTTGTTGCTG CAACTGACGG CCCGATGCCG CAGACTCGTP 
AGCACATCCT GCTGGGTCGY CAGGTAGGCG TTCCGTACAT StOGTgSS 
C TGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG SSoSSS 

35 £££££ SSSSSE JESS™ SSSt lS " 



CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG 
TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT SJIS^cJ 
GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCT ATC^^cS 
SSJSS? 11 CTCTGGTCQ T GGTACCGTTG TTACCGGTCG TgJS^g? 

CATCA aagttggtga agaagttgaa atcgttggta tca^ga?ac 

TGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGAPP 

aaggccgtgc tggtgagaac gttggtgttc ?gctgcgt5S ?I?SS^g? 
™^ TCG aacgtggtca ggtactggct aagccgggct ctatca?g2? 

n™S CAAG TTCGAA TCTG AAGTGTACAT TCTGTCCAAA gIcGAMG^G 

SSEKSS xss&ss stccccS?? ss 



50 
100 



200 
250 
300 
350 

40 3GTATCATCA AAHTT^T^ ^TAGAGCGC 400 

450 
500 
550 

45 ess™ tccgttc ^c sssssxss OTccGc^f c?a™S? Ill 

actactgacg tgactggtac catcgaactg ccggaaggcg ttgaga?S2? ?SX 

^^S GGGC GACAACATCA AAATGGTTGT TACCCtSt? S?JSJ?2g Inn 
CGATGGACGA CGGTCTGCGT TTCGC v^AUCuAATCG 800 

825 



50 



2) INFORMATION FOR SEQ ID NO : 2 6 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: ]r 0ser i 

(A) ORGANISM: Catrobacter Jcosen 

(B) STRAIN: ATCC 27156 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA C^CCCGATG CCGCAGACCC 
GTGAGCACAT CCTGCTGGGT gGTCAGGTAG ^ c tGGAACTGGT 

10 TTCCTGAACA AATGCGACAT GGJTGATGAC ^ ccgggcGACG 

tgagatggaa gtgcgtgaac tgctgtctca ^ tqga AGGCGAM gct 

ACACGCCGAT CGTTCGTGGT TCTGCaciun CTGQ ATTC TTACAT 

gagtgggaag cgaaaatcat cgaactggct ^ctac gctg CCGATCGAAG 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC fa TCGTGTAGAG 
15 ACGTATTCTC CATCTCCGGT COTGGTACCG ^ Q GTAT CAAAGA 

CGCGGTATCA TCAAAGTGGG CGAAGAAt>ii aatqttccGC AAACTGCTGG 
GACTGCGAAG TCTACCTGTA CTGGCGTTGA AATGTT TGGTATCAAA 
ACGAAGGCCG TGCTGGTGAG AA^GGTG ^ G yxc C ATCAA 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG BU AAAGATGAAG 

- ggss sss™ iiiii sssss sssss 

SSS3SSS GGCGACSS SSSESS StTACCCTG ATCCACCCGA 
??g?gIggac GACGGTCTGC GTTTCGCAA 



25 



■»n (i) SEQUENCE CHARACTERISTICS : 

30 (A) LENGTH: 827 bases 

(B) TYPE: Nucleic aci 

(C) STRANDEDNESS: Double 



2) INFORMATION FOR SEQ ID NO: 27 
SEQ' 

(A) UDUUIU' • 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Doc 1 - ' 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

4Q ( B ) STRAIN: ATCC 51112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

COCGATCCTO — SSSSSSS SSSSSS 

45 AGCACATCCT GCTGGGTCGT ^TAGGCG TTC QG ^ CTGGTAGA 

CTGAACAAAT GCGACATGGT JGATGACGAA CQ GGCG ACGACA 

GATGGAAGTT CGTGAACTGC TGTCTCAG1A QQ CGACG CAGAG 

CGCCGATCGT TCGTGGTTCT J^TCTGAAAG ^ CTTACATCCC 
TGGGAAGCGA AAATCATCGA ^CTGGCAGGC igt ATCGAAGACG 

50 GGAACCAGAG CGTGCGATTG ^GCCGTT CCTQCTGC TGTAGAGCGC 
TATTCTCCAT CTCTGGTCGT GGTACCGTT^ ATCGTTGGTA TCAAAGAGAC 
GGTATCATCA AAGTGGGTGA JGAAGTTGAA JTCGT CTGCTGGACG 
TGCCAAGTCT ACCTGTACTG ^^^GAAAT ^ TATCAAACGT 
AAGGCCGTGC TGGTGAGAAC GTAGGTGTTC CCATCAAGCC 

55 GAAGAAATCG AACGTGGTCA J™CTGGCT ™CCew^ GACGAAGGCG 
RCACACTATG TTCGAATCTG AAJTGTACAT J™ CTACTTCCGT 
GCCGTCATAC TCCGTTCTTC ^g™C SgGAAGGTG TTGAGATGGT 

S£S SSS«c cccc^tco 

60 Satggacga CGGTCTGCGT ttcgcaa 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
829 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
827 
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2) INFORMATION FOR SEQ ID NO: 28 

(i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 <D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8 090 



20 



25 



30 



35 



.40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

££SSSS SS , CGTQAGCACA 50 

AAATGCGACA TGGTTGATGA CgSgIgS? GTTCG TGAAC 100 

AGTTCGTGAA CTTCTctSS 2£J5£5 SSSSSS? 5X5*™°* 150 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTCr iSSS?^ GACACTCCGA 200 

GCGAAAATCA TCGAACTCGC SSSSSiS 250 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT TCCtSSS rS%SS*F C 300 

CCATCTCCGG TCGTGGTACC GTTGTTArrr SSSSSS* GACG TATTCT 350 

ATCAAAGTTG GTGAAGAAGT SKSSg? 2S322S G ^ GGTATC 400 

GTCTACCTGT ACTGGCGTTG AAATG?t??? ^StSSSK AGACTGCT AA 450 

gtgctggtga gaacgttggt gt?c?gSg? S?5S?I?S? ?5 ggaaggcc 5 oo 

ATCGAACGTG GTCAGGTACT GGCTAaSSc ACGTQ AAGAA 550 

ssss sggsE Sii SH lEP g; 



2) INFORMATION FOR SEQ ID NO: 29 

( i ) SEQUENCE CHARACTER I S T I CS ■ 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 
AC c > STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE- 
50 (A) ORGANISM: Citrobacter sedlakii 

(B) STRAIN: ATCC 51115 



55 



60 



750 
797 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

SSSSSS C C SS gggSSS o^S^ CCG ^ccc 

TTCCTGAACA AATGCGACAT CATCATCGTG lo8 

AGAGATGGAA GTTCGTGAAC TGCTGTPTna GAAGA * CTGC TGGAACTGGT 15 0 

ACACGCCGAT CGTTCGTGG? JESSES? CCGGGCG ACG 200 

GAGTGGGAAG CGAAAATCAT SSSSSSf SSgSSSS JggiSS ?o°o 
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ssss EES =11 1IH 

CGCGGTATCA TCAAAGTGGG CGAAGAAGT1 AAACTGCTGG 
GACTGCGAAG TCTACCTGTA CTGGCGTTGA AAgTT^ TGGTATCAAA 
ACGAAGGCCG TGCGGGTGAG ^GTAGGTG JTCTGC GCACCAXCAA 
CGTGAAGAAA TCGAACGTGG TCAGGTACTG wuj aaaqaTGAAG 
GCCGCACACC AAGTTCGAAT CTGAAGTGTA ™TTC^ GTTCTACTTC 
GCGGCCGTCA TACTCCGTTC CTGCCGGAAG GCGTAGAGAT 

SS£«K ?a™cc CT a 



TCGCGATGGA CGACGGTCTG CGTTTC 



20 



15 2) INFORMATION FOR SEQ ID NO: 30 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: . . 
^ (A) ORGANISM: CitroJbacter werkmann 

(B) STRAIN: ATCC 51114 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

3 0 ^^>pp^.r rrczATGCCGC AGACTCGTGA 50 

100 
150 
200 

35 rCCGATCGTT C^i^ % rMATTC TTACATCCCG 300 

350 
400 
450 
500 

40 GCTAAGTCTA WSM^s** ?^?nv^ ATCAAACGTG 550 

600 
650 
700 

45 CTACTGACGT GACUWr*^ ACCCGATCGC 8o2 

823 



rrCATCCTGG TTGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 
^nl^rrTC CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 
SSSSS?? cgaStggtt GATGACGAAG AGCTGCTGGA ACTGGTAGAA 
g?SS?St gtctcagtac gatttcccgg gcgacgacac 

SSS ?CCGG?C?TO GTACCGTTGT TACCGGTCGT GTAGAGCGCG 

SmoSSa agttggtgaa gaagttgaaa tcgttggtat caaagacacc 

™;7^r~n2 rrTGTACCGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 

SS££a SgS Sogtgttct gctgcgtggt atcaaacgtg 
££££££ iKSSX S?g™ c?g?ccaa!g SgSg 

rcSS^S SxStcSS AAGGCTACCG TCCGCAGTTC TACTTCCGTA 
n^wlnrr OACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA 
SScSg^CG AATGGTTGTT ACYCTGATCC ACCCGATCGC 

gatggacgac ggtctgcgtt tcg 



50 

2 ) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 6 bases 
55 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 



151 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter youngae 

(B) STRAIN: ATCC 29935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 
TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 



50 

TCCTGAACAA ATGCGACATG GTOgIt'ga'cG AAGAG^GCT SSaSSSI 5" 2 2 

10 gaaatggaag ttcgtgaact tctgtctcag tacgatttcc coqgSS?? ™2 

Tftmrrr.aTr ottp^p^ ^m^™ — ZVT . ____ ii< - c CGGGCGACGA 200 

250 
300 
350 

15 GCGGTATCAT CAAAGTTGGT SUSSSSS AA^TCGTTOG 53SSSSSSS 222 
ACTGCCAAttT rTanrTP.TAP nvrw^mm^- - V22±il_li_^ <~ AAA GAG 450 

500 

550 

600 

20 CGGCCGTCAT RrWRTT^ vn\\~^^ "V^l'-l? aau A UG AA G G 650 

700 
750 
800 
826 



TACGCCGATC GTTCGTGGTT CTGCTCTGAA aScGCTGGAA SSSSSsS 
AGTGGGAAGC GAAAATCATC GAACTGGCTG gSSSSS SSSJJSS? 
C CGGAAC CAG AACGTGCTAT CGATAAGCCG TTCCTGCTGC CAaSSSS 
CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTAc5gg5 
GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 
ACTGCCAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTCcI^ 
n^ GCCGr GCTGG TGAGA ACGTTGGTGT TCTGCTGCGT S33SS25 
GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 
CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGr 
SZS2S£ T ACTCCGT TCT TCAAAGGCTA CCGTCCGCAG tJSSJJSS 
™^ ACTGA CGTG ACGGGT ACCATCGAAC TGCCGGAAGG CGTAGAGAtS 
GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TcSccSI? Inn 
CGCGATGGAT GACGGTCTGC GTTTCG ^^ALCCTGA TCCACCCAAT 800 



25 

2 ) INFORMATION FOR SEQ ID NO: 32 

( i ) SEQUENCE CHARACTER I S T I CS : 
30 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 ( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

4 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

CGGAGCTATA TTAGTTTGTT CAGCAGCTGA TGGTCCAATG crTraaanan 

« ^ GAGCACAT CTT ATTATCA TCAAGAGTTG GAGTTGACCA Sc^SS , 

45 TTCTTAAACA AAGCAGATAT GGTTGACGAC GAAGAATTAT SSSSSSJ nSn 

TGAAATGGAA GTTAGAGAGT TATTAAGCGA GTACAACTTC «SsmS5 J«2 

AYATTCCAGT AATCAARGGA TCAGCTTTAG TAGCATtIga fSSS??S 2 °° 

aaPTrsspss GAGTTCTACA TGTAGGAGA? SJSSSE ?£Sg£JS tin 
AACTGAAGAA AGAAGAAAAA CTGTTGTAAr anna^nnAA tr^ZZ 450 
AGTTATTAGA TGAAGCACAA SSSIgmA aSIS?^ 500 

ss SS^SS S55SS sElillS 111 
= 2SSSSS. SSSES S I™ "° 
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2) INFORMATION FOR SEQ ID NO: 3 3 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 

15 (B) STRAIN: ATCC 15668 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

CGGCGCCATC CTGGTGTGCT CGGCCGCTGA CGGCCCCATG CCCCAGACCC 
20 SSagScat CCTGCTGGCC CGTCAGGTGG GCGTGCCCTA catcatcgtg 

ttSctgaaca agtgcgacat ggtggacgac gaagagctgc tggaactggt 

G?5cGCGAGC TGCTTGCCAA GTACGACTTC CCCGGCGACG 
aSototm Stccgcggc TCGGCCAAGC TGGCCCTGGA AGGCGACCAG 
?tcSacaagg GCGAACCTGC CATCCTGCGC CTGGCTGAAG CACTGGACTC 
« SSStcccc ACGCCCGAGC GCGCTGTGGA CGGCGCCTTT GCAATGCCCG 
?ggSgacS CTTCTCGATC TCTGGCCGTG GCACCGTGGT GACTGGCCGT 
I??™CGCG GCATCATCAA GGTCGGCGAA GAAATCGAAA TCGTCGGTAT 
SSagacca TCGTCACCGG CGTGGAAATG TTCCGCAAGC 
?S?5SacS aggtcaagct GGCGACAACG TGGGTCTGCT GCTGCGCGGC 
30 I?SSgcgtg AAGACGTGGA ACGCGGCCAA GTGCTGTGCA AGCCCGGCTC 

cSS^aScCC CACACCCACT TCACGGCTGA GGTGTACGTG CTGTCCAAGG 650 
ACGAAGGTGG TCGCCACACT CCGTTCTTCA ACAACTACCG TCCCCAGTTC 
JatttcSgta CGACCGACGT GACCGGCTCC ATCGAGCTGC ccgccgacaa 
gSS?Sotg atgcctggcg acaacgtgtc gatcaccgtc aagctgatcg 
3 5 cccccatcgc catggaagaa gg 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 



700 
750 
800 
822 



2) INFORMATION FOR SEQ ID NO: 34 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (a) ORGANISM: CoryneJbacterium bovis 

(B) STRAIN: ATCC 7715 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

cc GCCGCAGACC CGTGAGCACG TCCTCCTGGC CCGTCAGGTC GGTGTGCCCT 50 

ACATCCTCGT CGCCCTCAAC AAGTGCGACA TGGTCGACGA CGAGGACCTC 100 

ATCGAGCTCG TCGAGATGGA GGTC CGTGAG CTCCTCGCCG AGCAGGACTA 150 

CGACGAGGAC GCCCCGATCA TCCACATCTC CGCCCTCAAG GCCCTCGAGG 200 

GTGACC CGGA GTGGACGCAG CGCATCGTCG ACCTCATGAA GGCCTGCGAC 250 

60 GACGCCATCC CGGATCCGGA GCGCGAGACG GACAAGCCGT TCCTCATGCC 3 00 

153 
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GATCGAGGAC ATCTTCACGA 
GTGTCGAGCG TGGCATCCTC 
ATCTGCGAGA ACTCCCAGAA 
CAAGTTCCTC GACACGGCCG • 
5 GTGGCCTGAA GCGCGAGGAC 
GGCGCCTACA CGCCGCACAC 
CAAGGACGAG GGTGGCCGCC 
AGTTCTACTT CCGGACGACC 
GG 



TCACCGGCCG CGGCACCGTC 
AACGTCAACG AGGAGGTCGA 
GACGACCGTC ACCTCCATCG 
AGGCCGGCGA CAACGCCGCC 
GTCGAGCGTG GCCAGATCGT 
CGAGTTCGAG GGCTCCGTGT 
ACACGCCGTT CTTCGACAAC 
GACGTCACCG GCGTCGTCAA 



GTCACGGGCC 3 50 

GATCCTGGGT 4 00 

AGATGTTCAA 450 

CTGCTGCTCC 50 0 

GGCCAAGCCG 550 

ACATCCTCTC 600 

TACCGTCCGC 650 

GCTGCCGGAG 700 
702 



2 ) INFORMATION FOR SEQ ID NO : 35 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



45 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Corynebacterium cervicis 
25 (B) STRAIN: NCTC 10604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

*n CGCGAGCACG TTCTGCTTGC TCGCCAGGTT GGCGTTCCGA 50 

30 £^£ CCTGGT TQ CCCTCAAC AAGGCCGATA TGGTCGACGA TGAGGAAATG inn 

CTGGAGCTCG TTGAGGAAGA GTGCCGCGAC CTGCTCGAGT COCtoSS?? tSn 

CGATCGTGAC GCCCCGATCA TCCAGGTTTC CGCGCTGAAG GCTCt2gSg Inn 

GTGATCCGCA GTGGGTTGCT AAGGTCGAGG AGCTCATGGA GGCAGTc5J? oln 

« ^ TCGTGC CGA CTCCTGA GCGCGACATG GACAAGCCGT SSSStoc 300 

3 5 GATCGAAGAC GTCTTCACCA TCACCGGCCG TGGCACCGTT GTTACCGPTP 11 n 

GTGTTGAGCG TGGCAAGCTC CCGATCAACT CTGAGGtJgA Inn 

ATCCGCGAAC CGCAGAAGAC CACCGTTACC GGTATCGaS TC^SaSJ 4S0 

GTCCATGGAT GAAGCATGGG CAGGCGAGAA CTGTGGTCtC CTC CTGCGTG tnn 

An ^ CCAAGCG CGAT ^GTT GAGCGCGGTC AGGTCGTTGC CCTTCcSS? XX 
40 TCGATCACCC CGCACACCAA CTTCACCGGA CAGGTCTACA TCCTCaSS 

GGAAGAAGGC GGTCGTCACA ACCCGTTCTT CTCGAACTA^ SSSS £n 

TCTACTTCCG CACCACGGAC GTGACCGGCG TCATCACCC CCGCAGT 65 0 



2) INFORMATION FOR SEQ ID NO: 3 6 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 804 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium flavescens 

(B) STRAIN: ATCC 10340 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 36 

154 



689 
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GGTTGTTGCT GCAACCGATG GTCCTATGCC GCAGACCCGC GAGCACGTTC 
TTCTGGCTCG CCAGGTTGGC GTTCCTTACA TCCTCGTTGC TCTTAACAAG 
SSSStSI TTGATGATGA GGAAATCATC GAGCTCGTTG AGATGGAAAT 
J^SStc ctcgctgagc AGGACTACGA CGAGGATGCC cccatcatcc 
SXScCGC SSSJScT CTTGAGGGTG ACGAGAAGTG GGTACAGGCC 

a5cg5Sac? tStgcaggc ctgcgatgac tccattccgg atccggagcg 

rJSarCGAC AAGCCCTTCC TCATGCCTAT CGAGGACAT C TTCACCATCA 
S^SSgG TACCGTTGTT ACCGGCCGTG TTGAGCGTGG CGTTTTGAAG 
SSSrSK ATGTTGAGAT CATCGGCATC AAGGAGAAGT CCATCTCCAC 

SESSScc ogStcgXa TGTTCCGCAA gatgatggac tacaccgagg 
SSSSoS ctgtggtctg cttctgcgtg gtaccaagcg tgaagaggtc 
pISgcggcc aggttgttat caagccgggc gcctacaccc cccacaccaa 
SttcSSoot tccgtctacg tcctcaagaa ggaagagggc ggccgccaca 

SiiSSSSi GGACAACTAC CGTCCGCAGT TCTACTTCCG TACCACTGAC 

SSSSSS SottScct gcctgagggc accgagatgg tcatgcctgg 
SSSSmSt gatatgaccg ttgagctcat ccagcccgtc gctaggatga 



20 



35 



GGGC 



2) INFORMATION FOR SEQ ID NO: 37 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CoryneJbacteraum kutschen 

(B) STRAIN: ATCC 15677 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

TGCCTCAGAC CCGTGAGCAC GTTCTTCTTG CTCGCCAGGT TGGCGTTCCT 
SStcctcg TTGCTCTTAA CAAGTGCGAC ATGGTTGACG ATGAGGAAAT 
StcGAGCTC GTTGAGATGG AAGTTCGCGA GCTTCTTGCT GAGCAGGAGT 
acgatcSga GGCTCCAATC ATCCACATCT CTGCTTTGAA GGCTCTTGAG 
OGCGACGAGA AGTGGACTCA GGCCATCATC GACCTCATGC AGGCTTOTGA 
SSStcCATC C CAGATC C AG AGCGTGAGAC CGACAAGCCA TTCCTCATGC 

c?a5~ tScttcacc atcaccggtc gtggcaccgt tgttaccggt 



50 
100 

40 CATCGAGCTC «™^x^ — — — — ^TT^A GGCTCTTGAG 2oS 

250 
300 

,= StoSSSKS gSgGctSS S^JSmJ GAGGACGTCG AGATCATCGG 400 

550 



SS^??S? TGATTACACC GAAGCTGGCG ATAACTGTGG TCTGCTTCTT 
Stmtatca AGCGCGAAGA CGTTGAGCGT GGTCAGGTTG TTGTTAAGCC 
SrSSSc ACACCTCACA CCGAGTTCGA GGGCTCTGTT TACGTTCTTT 600 
50 cS^aSS SS?GGCCGC CACACCCCAT TCTTCGACAA CTACCGTCCA 650 
S££tc£act TCCGCACCAC TGACGTTACC GGTGTTGTGA AG 



650 
692 



55 2) INFORMATION FOR SEQ ID NO: 38 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 97 bases 

(B) TYPE: Nucleic acid 
60 <c) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebac terium minutissimum 

(B) STRAIN: ATCC 23348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

CCTGGTTGTT GCTGCAACCG ATGGCCCGAT GCCGCAGACC CGCGAGCACG 
TTCTTCTGGC CCGCCAGGTT GGCGTTCCGT ACATCCTCGT TgSotSS 
AAGTGTGACA TGGTTGACGA TGAGGAAATC ATCGAGCTCG TtSSatSSa 



TCCATCGTTG ACCTGATGCA GGC^GcZl? GACTCCATCC SSKSSS 
GCGCGAGCTG GACAAGCCGT TCCTGATGCC GATCGAGGAC ATCT^S^S 

SSS525 2™«5E KSSS 



50 
100 

GATC CGTGAG CTGCTCGCTG AGCAGGACTA CGACGAGGAA 5SSS3SS 200 
15 TTCACATCTC CGCTCTGAAG GCTCTTGAGG G CGACGAGAA gSSSSS ??S 

300 

aacgttaacg aggacatcga gatcatcggt atcaag^aS ag^ccSS? til 

20 CACCACCGTT ACCGGTATCG AGATGTTCCG CAAGATGATG GACTACACCG inn 

AGGCTGGCGA CAACTGTGGT CTGCTTCTGC GTGGTACCAA GCGTGaJSS «X 

GTTGAGCGTG GCCAGGTTTG CATCAAGCCG GGCGCTTACA Cc2SctSc inn 

CAAGTTCGAG GGTTCCGTCT ACGTCCTGAA GAAGGAAGAG ScSSgSS «X 
« ACACCCCGTT CATGGACAAC TACCGTCCGC AGTTCTACTT OCGC^SSc 

25 GACGTCACCG GTGTCATCAA GCTGCCGGAG GGCACCGAGA TgStCaS^C 7^ 

GGGCGACAAC GTTGAGATGT CCGTAGAGCT GATCCAGCCG GTCGCTA 79? 



3 0 2) INFORMATION FOR SEQ ID NO: 3 9 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 702 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium mycetoides 

(B) STRAIN: ATCC 21134 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

^GCAGACC CGCGAGCACG TTCTTCTGGC CCGCCAGGTC GGCGTCCCCT 
ACATCCTCGT TGCGCTGAAC AAGTGCGACA TGGTTGATGA SgE£££S 
ATCGAGCTCG TGGAGATGGA GGTCCGTGAG CTGCTCGGCG AGCAGGAOT? 
CGACGAGGAC GCCCCCATCA TCCACATCTC CGCTCTGAAG GCTCTC^r 
GCGACGAGAA GTGGGTTCAG TCCGTGCTCG ACCtStGCA tcSSSSS 

GAC I CCATCC cggatccggt ccgcgagacc gaccgcgact tcctSSS? 

SJS^AGGAC ATCTTCACCA TCTCCGGCCG CGGCACCGTG OTAc££S£ 

°tgtggagcg cggcgtgctc aacctcaacg acgaggtcga gatca?2S£ 
at ^cgaca agtcccagaa gaccaccgtc acctccatcg ag5?g??S 
^°^gctc gataccgctg aggcaggcga caacgcggct ctgctoSS* 
SS^ctgaa gcgcgaggac gtcgagcgtg gccaggttgt catSagS? 

5?^CCTACA CCCCGCACAC CAAGTTCGAG GGTTCCGTCT SgJotSS? 
^™ CGAG GGCG GCCGCC ACACCCCGTT CTTCGACAAC TACCgSSgp 
AGTTCTACTT CCGCACCACC GACGTGACCG GTGTTGTGAA SSSSSE 
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2 ) INFORMATION FOR SEQ ID NO: 40 

D (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 674 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) 0^ GIN ^ G j^? : Corynebacteriu* pseudogeni taliu* 
(B) STRAIN: ATCC 33038 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
„„„~r.~ wnrrrrTCC TTACATCCTC GTTGCGCTGA ACAAGTG CGA 

2SS™S SatgaggSa SmSagct cgttgagatg gagatccgtg 
S^gScgc agagcaggat tacgatgagg aagctcctat cgttcacatc 
iSSSSS aggccctcga gggcgatgac aagtgggtac aotccgtcgt 
^p??SrATG gaagcctgcg acaactccat cccggatccg gagcgcgcta 
JSaccaS SEScSSS cctatcgagg acatcttcac cattaccggc 
n^r^rrr ttgttaccgg ccgtgttgag cgtggccgtc tgaacgtcaa 
SSS?I?r?? gagatcatcg ctatccagga gaagtcccag accaccaccg 
SSSSSSS cgcaagatga tggactacac cgaggctggc 

rACAACTGTG GTCTGCTTCT GCGTGGTACC AAGCGTGAGG ACGTTGAGCG 
%^^*rrTT GTTATCAAGC CGGGCGCTTA CACCCCGCAC ACCAAGTTCG 

ISSSS? 2ScgSctg AAGAAGGAAG AGGGCGGCCG ccacaccccg 
t??aSaaS Staccgtcc GCAGTTCTAC TTCCGTACCA CGGACGTTAC 
CGGTGTTGTT CACCTGCCAG AGGG 



20 GCTCGCCAGU L™^i, X ^ ^ZZXnn* rr.TTr,.r,ATn 3AGATCCGTG 100 

150 
200 
250 
300 

25 CCGACCAG c iTJ ^ _ : - wnJ^ra-vr TfiAACGTCAA 350 

400 
450 
500 
550 

30 TGGCCAGGTU ^^^ z ^^ Z ^ ; I ^Jzraarra TACACCCCG 600 

650 
674 

CGGTGTTGTT CACCTGUUA^ *sj>^ 
35 



2) INFORMATION FOR SEQ ID NO: 41 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: t 

(A) ORGANISM: CoryneJbacterauzn renale 

(B) STRAIN: ATCC 19412 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

TGCCTCAGAC CCGTGAGCAC GTTCTGCTTG CTCGTCAGGT CGGCGTTCCT 50 

ScATCCTCG TTGCACTGAA CAAGTGCGAC ATGGTCGACG ACGAAGAAAT 100 

^ Ja^gSgCTC GTCGAGATGG AAATCCGTGA ACTGCTCGCA GAGCAGGACT 150 

SSSSagoa Sgctcctatc GTTCACATCT CCGCTCTGGG CGCCCTGAAC 200 

SSSScAQA AGTGGGTTGA CTCCATCGTC GAACTGATGG AAGCTTGCGA 250 

SSS2S?C cSgACCCAG TTCGCGACAT CGACCACCCA TTCCTGATGC 300 

™^P^ CATCTTCACC ATTAC CGGTC GCGGTACCGT TGTTACCGGC 3 50 

60 CGTGTCGAGC SLgTCAAC GAAGAAGTTG AGATCATCGG 400 

157 
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TATCAAGGAC AAGTCCCAGA 
GCAAGATGCT GGACTACACC 
CGCGGCATCG GCCGTGAGGA 
AGGCGCTTAC ACCCCTCACT 
5 C CAAGGACGA GGGTGGCCGC 
CAGTTCTACT TCCGCACCAC 



AGACCACCGT CACCGGTATC 
GAAGCTGGCG ACAACTGTGG 
TGTCGAGCGT GGCCAGGTTA 
CTGAGTTCGA GGGCTCTGTC 
CACACCCCAT TCTTCGACAA 
CGACGTGACC GGCGTTGTGC 



GAGATGTTCC 450 

TCTGCTGCTC 50 0 

TCATCAAGCC 550 

TACGTCCTGT 600 

CTACCGTCCA 650 

ACCT 694 



10 2 ) INFORMATION FOR SEQ ID NO: 42 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 687 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium ulcerans 

(B) STRAIN: NCTC 8665 



25 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

GCCGCAGACC CGCGAGCACG TTCTGCTGGC TCGCCAGGTT GGCGTTCCKT 
ACATCCTSGT TGCACTGAAC AAGTGCGACA TGGTTGACGA TGAGGARCTC 
CTSGAGCTCG TCGAGATGGA GGTCCGCGAG CTGCTGGCTG AGCAGGACTA 



50 
100 

CGACGAGGAA GCTCCGRTCG iJSSSSE CGCWCtSaC 3S5SS55S To 
3 0 GCGACSAGAA GTGGGCTVAC TCCATCCTCG AGCTGATGCA GGCTtScgIS l%n 

300 
350 
400 
450 
500 
550 



GAGTCCATCC CGGATCCGGA GCGCGAGACC GACAAGCCGT TCCTGATGCC 
GATTGAGGAC ATCTTCACCA TTACCGGTCG CGGYACCGTT GTTACCGGCC 
GTGTTGAGCG TGGCDTCCTG AACGT SAACG ACGASGTTGA GATCATGGGY 
ATCCGGGAGA AGTCCCAGAA GACCACCGTY ACCKSCATCG AG^SSSX 

3 5 CAAGMTGMTG GACWCCGCAG AGGCTGGCGA CAACGCTGSW CTGCTGCTG^ 
GTGGTMTSAA GCGTGAGGAC GTTGAGCGTG GCCAGATCAT CGYTAAGCCG 
GGCGCKTACA CCCCGCACAC CGAGTTCGAG GGCTCCGTCT ACGTCCTgSS 
CAAGGACGAG GGCGGCCGCC ACACCCCGTT CTTCGACAAC TACCGtIcIc tin 
AGTTCTACTT CCGCACCACC GACGTSACCG GTGTTGT TACCGTCCGC 650 

40 00/ 

2) INFORMATION FOR SEQ ID NO: 43 

45 (i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 778 baBes 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium urealyticum 
55 (B) STRAIN: ATCC 43 042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

CTGGTTGTTG CTGCAAC CGA TGGCCCGATG CCGCAGACrr rTnapnn^ 
60 TCTGCTGGCT CGCCAGGTTG GCGTTCCGTA S5££^ ^SS^ JS 

158 
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10 



15 



2) INFORMATION FOR SEQ ID NO: 44 

9 0 (i) SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: 703 bases 

(B) TYPE: Nucleic acid 



25 



30 



35 



40 



45 



f B) Tira: i.^-^ - 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

<Vil 'ST'SaKE : cory»eba=«riu„ xerosis 
(B) STRAIN: ATCC 373 



50 



2) INFORMATION FOR SEQ ID NO: 45 

(i) SEQUENCE CHARACTERISTICS: 
cc (A) LENGTH: B32 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 ( i i ) MOLECULE TYPE: Genomic DNA 

159 



150 
200 
250 
300 



AGTGCGACAT GGTTGACGAT ^GAGCTCC TCGAGCTCGT CGAGATGGAG 
GTCCGCGAGC TTCTGGCTGA GCAGGACTAC GACGAG XGGGTCGACT 
CCCGATCTCC GCACTGGGCG CCCTGGACGG CGATCAGAAG ggacccggag 

CCATCCTCGA GCTCATGAAG GCTTGCGACG TCTTCACCAT 350 

CGCGAGACCG ACAAGCCGTT CCTGATGCCG ^TGAGGA GGCGTCCTGA 400 

TACCGGTCGC GGCACCGTCG TTACCGGCCG JCTTGA^L GTCCACCAAG 450 

ACCTGAACGA CGAGGTCGAG ATCCTGGGCA JCOOCa*^ ACACCGCAGA 500 

ACCACCGTCA CCTCCATCGA GATGTTCAAC AggTOL^ CGTGAGGACG 550 

GGCTGGCGAC AACGCTGCAC TGCTGCTGCG Q CCCGCACACC 600 

TCGAGCGAGG CCAGATCATC ^TAAGCCGG GJOCT^C ^ 

GAGTTCGAGG GCTCCGTCTA CGTCCTGTCC ™gj CGTACCACCG 700 

SSSSSS ™GAS gScCGACAT GGTCATGCCG 750 
SSSSSS TTGAGATGAG CGTCAAGC 



50 
100 
150 
200 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
CGCAGACCCG TGAGCACGTC CTCCTGGCCC GCCAGGTCGG CGTCCCCTAC 
ATCCTCGTCG CCCTGAACAA GTGCGACATG ££££ CAGGAC TACG 

CGAGCTCGTG GAGATGGAGG TGCGTGAGCT JCTCGCCGA qctCAATGGC 
ACGAGGAGGC CCCGATCGTG CACMCTCCG gCCTGG^ CCGTCGACGA 250 

gaagagaagt gggtcgactc catcgtcgag ^atgaacg tgcccg 3 0 o 

GAACGTTCCG GACCCGGTCC GCGAGACCGA CACCGGTCGC 350 

TCGAGGACAT CTTCACCATC ACCGGCCGCG JgJgCfalwj. TCCTGGGCAT 
GTGGAGCGCG GCACCCTGAA GfTCAACGAC ^GTCGAGA ^ 
CCAGGAGAAG TCCCAGACCA CCACCGTCAC CGGCA1C GCTGCTCCGC 
AGCTGCTGGA CTCCGCCGAG GCCGGCGACA ACTfa^ CGAAGCCGGG 
GGCATCAAGC TCGAGGACAT CGAGCGCGGC ggMO^ ATCCTGGCCA 
CGCCTACACC CCGCACACCG AGli^^o ^ CTA CCG TCCGCAG 

SSSSS SSS £S££«IC TGCCGG^CG 



400 
450 
500 
550 
600 
650 
700 
703 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: ' Coxiella burnetii 

(B) STRAIN : Nine Mile phase II 



10 



15 



20 



25 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO- 45 

===== * 

333SS sssss SEE ggg gas gs 

GTGAGGTTGG GGAGCCATCG ATAATCAAAT na^^^ 250 

TACTTCCCGC AGCCGGAGCG AGCGATATAP GATGGACACG 3 00 

CGAAGATGTG TTTTCGATAT OoSSSSSS ^SS GTTTT TAATGCCGAT 350 

TAGAGCGAGG GATCATCAAA SSSSS?" 3 ACGGGACGCG 400 

AAGGACACGA CGAAGACGAC* SSSK£S ^ GGGGATC «0 

ATTGGATGAA GGTCAAGCGG StgAcS?g5 SSSSJJSJ TTCGCAAATT 500 

CGAAACGCGA AGAAGTGGAG CGTGGTcSg T^rrJ^ TTGAGAGGGA 550 

ATCACGCCAC ACAAGAAATT TGArrrfiPT? ^^?2£ GAA ACCGGGATCG 600 

AGAAGGGGGA CGclScACAC SiSS?? J™? GTGT TGTCGAAGGA 650 

ATTTCCGCAC GACGGACotS SSSSS iSSSSS CCGCA ™T 700 

ATAGAGATGG TGATGCCGGG AGmSSStc ACCGGA GGGG 750 

tgcgccggta gcgatggatg ^ gtgacgg ttgaattgat 800 

Q *> O 



2) INFORMATION FOR SEQ ID NO: 4 6 

30 (i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

C STRANDEDNESS : Double 
(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE ■ 

40 £! e^ ANISM: Aoahiaae 

40 (B) STRAIN: ATCC 33379 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 



45 



50 



55 



60 



50 



SS? SSSEE 3SSSS2S ^; CCGATGC gggagag ccg 

TCCTGAACAA GTGCGACATG SSSSS SSSSSS ATCATCG TGT 

GAGATGGAAG TTCGCGAACT GcStAtJS ™^ CTGCT CGAACTGGTT i 50 

TACGCCGGTA ATCoSSSS SSSSSK CGGGCGAGG A 200 

AGTGGGAAGC GAAGATCATC GAACtSSS a?SS3S£ ^^GCCC 250 

ccggaacctg agcgtgacat SISSIES %^ ggctgga ct cctacatt 300 

CGTATTCTCA AlciSSS? £SES£ GGATCGAAG A 350 

gcggtatcat caaggtaggc gacgaaStt? SJJSSS^ cgtgt agagc 400 

ACCACCAAGA CTACCTGTAC TOGCM^SI? JSSS™ 0 T ATCAAGCCG 450 

cgaaggccgt gctggtgaga ISSSSJ 5S5T ccgca AACTGCTGGA 500 

GTGACGAAAT CGAACGTGGT oSSSSS ^ GCTGCGT GG TACCAAGC iso 

CCGCACACCA aS5?SE5? SSSSSH 00 CAGG ATCACT |qS 

CGGCCGTCAT ACTCCGTTCT TCA^gSSS iJSS^ AGGATGAAGG 650 

GTACCACTGA cgtgactggc acc£t!55S S™ gcgcag ttctacttcc 700 

GTAATGCCGG GCGACAACAT J^ISSSS TGCCGGAAGG CGTAGAGATG 750 

CGCCATGGAC GATGCT CAAGATGGTT GTTACCCTGA TCCACCCGAT 80S 



160 



816 
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2) INFORMATION FOR SEQ ID NO: 47 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwards i ell a tarda 

(B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 
TGAGCACATC CTGTTGGGTC GCCAGGTAGG CGTTCCGTAC ATCATCGTGT 
TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 
GAGATGGAAG TTCGCGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA 
CACGCCGGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCCG 
AGTGGGAAGC GAAGATCATC GAACTGGCTG AAACTCTGGA CTCCTACATC 
CCGGAACCTG AGCGTGACAT CGACAAGCCG TTCCTGCTGC CGATCGAAGA 
CGTATTCTCT ATCTCTGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 
GCGGTATCAT CAAGGTAGGC GACGAAGTTG AAATCGTTGG TATCAAGCCG 
ACCACCAAGA CCACCTGTAC TGGCGTTGAA ATGTTC CGCA AACTGCTGGA 
CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAGC 
GTGACGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCACT 
CCGCACACCA AGTT CGAATC TGAAGTGTAC ATCCTGAGCA AGGATGAAGG 
CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 
GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 
GTAATGCCGG GCGACAACAT CAAGATGGTT GTTACCCTGA TCCACCCGAT 
CGCCATGGAC GATGGTCTGC G 



2) INFORMATION FOR SEQ ID NO: 48 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23834 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 

CGGTGCCATC CTGGTGGTAT CCGCTGCTGA CGGCCCCATG CCTCAGACTC 
GCGAACACAT CCTGTTGGCT CGTCAGGTAG GTGTACCCTA CATCCTCGTA 
TTCATGAACA AATGCGACAT GGTAGATGAT GCCGAGCTGC TTGAGTTGGT 
TGAGATGGAA ATCCGCGACC TGCTCTCCAG CTATGACTTC CCTGGTGACG 
ACTGCCCGAT CGTACAAGGT TCCGCTCTCA AAGCCCTCGA AGGCGATGCC 

161 
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GGTTACAAAG AAAAAATCTT 
CCCCACTCCT CAACGTGCTG 
ACGTATTCTC TATCTCCGGC 
CGCGGCATCA TCAAAGTAGG 
5 CACT CAGAAA ACTACCTGTA 
ACGAAGG TCA GGCCGGTGAC 
CGTGAAGAAG TTGAGCGTGG 
TCCGCACACC AAGTTCAAAG 
GTGGTCGTCA CACCCCGTTC 
10 CGTACTACTG ACGTAACCGG 
GGTTATGCCT GGTGAGAACG 
TTGCTATGGA AGAAGGTCTG 



CGAACTAGCT GCTGCTTTGG 
TAGACAAACC CTTCCTGTTG 
CGTGGTACCG TAGTAACCGG 
TGAAGAGATC GAAATCGTTG 
CTGGCGTGGA AATGTTCCGC 
AACGTAGGCG TACTGCTGCG 
TCAAGTATTG GCTAAACCCG 
CCGAAGTATA CGTATTGAGC 
TTTGCCAACT ACCGTCCACA 
TGCTGTAGAG CTGGAGCCTG 
TAACCATCAC CGTAGAACTG 
CGCTTTGCGA 



ATAGCTACAT 300 

CCGATCGAAG 350 

TCGTGTAGAG 400 

GTCTGAAGCC 450 

AAACTGCTGG 500 

CGGTAC C AAA 550 

GCACCATCAC 600 

AAAGAAGAAG 650 

GTTCTACTTC 70 0 

GTGTAGAAAT 750 

ATTGCTCCGA 800 
830 



15 

2 ) INFORMATION FOR SEQ ID NO: 4 9 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 808 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter aerogenes 

(B) STRAIN: ATCC 13048 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 



25 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTPG 
TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 
TCCTGAACAA ATGCGACATG GTTGATGACG AAGAG CTGCT GgSctS?? 
35 n^™^ TTCGTG *ACT GCTGTCTCAG TACGATTTC? ^G^CgSS 
n™^ ATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGa2gSg 

agtgggaagc gaaaatcatc gaactggctg gcttcctgga ttcttacat? 

CCRGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCgSgA 
ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 
CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAaSaC 



50 
100 
150 
200 
250 
300 

40 GCGGTATCAT CAAACTT^t n-Z^n^o ^t^^i ^Y™ GC *00 

450 

500 

550 

600 

650 

700 

750 



ACCGCGAAAA CCACCTG^C TGGCGTTGAA A^TTCCGcI aKSSSSSS 
CGAAGGCCGT GCTGGTGAGA ACGTAGGYGT TCTGCTgEot G^IJSSaC* 
GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CAGCATCaJg 
CCGCACACCA AGTT CGAATC TGAAGTGTAC ATCCTGTCCA a5ga5gSSS 
45 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG ^S^S 
GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG JctIg^GRTC 
?GC^A?GG GG GCGACAACAT CAAAATGGTT GTTACCcISa SSSSS 

808 

50 

2) INFORMATION FOR SEQ ID NO: 50 

( i ) SEQUENCE CHARACTER I STICS • 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 



162 
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(vi) ORIGINAL SOURCE: „„^^„ a 

(A) ORGANISM: Enterobacter agglomerans 

(B) STRAIN: ATCC 27 989 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 
CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

^5??^ S??SS£? ?S5SS£ SScGCTOGA AGOCGSHGCW 250 
SS SSSS SKgJJCT q^GATCGAAG I 

■ NUB 1 

TCGCGATGGA CGACGGTCTG CGTTCGCA 

25 

2) INFORMATION FOR SEQ ID NO: 51 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Enterobacter amnigenuB 

40 (B) STRAIN: ATCC 33072 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACGC 50 

4 , otSSgcaSt cctgctgggt cgtcaggtag gcgttcctta catcatcgtg 100 

??S?gSa StGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

SSSSg^a gttcgtgaac ttctgtctca gtacgatttc ccaggtgatg 

iSSJSSAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 
?S?S?iS5o CTAAAATCGT TGAGCTGGCT GGCTACCTGG ATTCTTACAT 
50 ?C?SS^ GAACCTGCTA TCGATAAGCC ATTCCTGCTG CCAATCGAAG 

SStattctc TATCTCCGGC CGTGGTACTG ttgtaaccgg tcgtgtagag 
cSSSSSS ttSaagttgg cgaagaagtt gaaatcgttg gtatcaaaga 

r^PTTPTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 

SSSgtSg aacgttggtg ttctgctgcg tggtatcaaa 
„ cStgaSaaa tcgaacgtgg tcaggtactg gctaagccag gctcaatcaa 
SSgcacacc aaattcgaat ctgaagttta tattctgtcc aaagatgaag 
gSgSSS tactccgttc ttcaaaggct accgtccaca gttctacttc 

?S?2cScTG ACGTGACCGG C AC CAT CG AA CTGC CAGAAG GCGTAGAGAT 

SgcgacSca ttcagatggt tgttaccctg atccacccaa 

60 TCGCGATGGA TGACGGTCTG CGTTT 

163 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
825 



CA 02283458 1999-09-28 



5 



2) INFORMATION FOR SEQ ID NO: 52 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi ) ORIGINAL SOURCE : 
15 < A > ORGANISM: Enterobacter asburiae 

(B) STRAIN: ATCC 35953 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

™^? ATC CTGG TTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 
GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTT CATCATcSJr 
?™ GAACA AATGCGACAT GGTTGATGAC (3AAGAGCTGC SgS£5gg? 
AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGgScScJ 
ATACTCCAAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGgSSgS 
GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTotSS 
CCCAGAACCA GAGCGTGCGA TTGACAAGCC ATTCCTGCTG SISSSS 
ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCOTC??™r 
CGCGGTATCA TCAAAGTTGG CGAAGAAGTT GAAATCgSg SScSS ct 
^^ TAAG TCT ACCTGTA CTGGCGTTGA AaIStTCCGC A^SgStS 
nn^l~ CCG TGCT <3GTGAG AACGTTGGTG TTCTGCTGCG TgSSSSS 
CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCGAAGCCAG GCTCAa5??J 
GCCACACACC AAGTTCGAAT CTGAAGTGTA CATCCTGTc2 A^G^SSg" 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTr 
CGTACAACTG ACCTC^rrr.n Ti^»^n„ ^^i.-.r^ OTTCTACTTC 



20 

50 
100 
150 

25 GAGTGGGAAG CGAAAATPAT ^^n^^E ^CGACGCA 25S 

300 
350 
400 

3C ACGAAGGCCG TnrTnnT^n ^t'f^l^ ^CTGCTGG 5oS 

550 
600 

3= ££?Ss s ' *»5«ss SwSSS ss ™ 

35 TCGCGATGGA CGACGOTCTG JJ A * GAT<3GT TGTGACTCTG ATcScCCAR Sgg 

822 

4 0 2) INFORMATION FOR SEQ ID NO: 53 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 

(B) STRAIN: ATCC 35317 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO • 53 

AGAAATGGAA GTTCGTGAAC TrPTPTPTr» o^vS^? GC TGGA ACTGGT 150 
ACACTCCAAT SSSSSSS ^*2£EEIF CCAGG CGACG 200 



ACACTCCAAT CGTTCGTGGT TC^GCGCTgS 2SS5S SSSSSS5S 

164 



250 
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GAGTGGGAAG CAA^TCAT OGAACTGGCT OOCTTCCTGG AT^TTACAT 300 

CCCAGAACCA GAGCGTGCGA TTGACAAGCC ATTCL.1 TCGTGTAGAG 400 

ACGTATTCTC CATCTCCGGT JGTGGTACCG TTGTTA^ GTATCAAAGA 450 

CGCGGTATCA TCAAAGTTGG gAAGAAGTT ^JAATCGT AAACTGCTGG 500 

5 TACTGCKAAA TCTACCTGTA CTGGCGTTGA TGGTATCAAA 550 

ACGAAGGCCG TGCTGGTGAG ^GTTGGTG JTCTGCT^ 600 

CGCGAAGAAA TCGAACGTGG JCAGGXTCTG OCO^BOCg ^ 650 

GCCACACACC AAGTTCGAAT CTGAAGTGTA ^LX.£L GTTCTACTTC 700 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA ^TA^ 

10 gggSSS SSSS2SSS SS TGTGACGCTG ATCCACCCAA 300 
TCGCGATGGA CGACGGTCTG CGTTTC 

2) INFORMATION FOR SEQ ID NO: 54 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EnteroJbacter cloacae 

(B) STRAIN: ATCC 13047 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

Wmmmm I 

ATCATCAAAG TGGGTGAAGA AGTTGAAATC GTTGGTATCA AAGAGACTGC 45 
JISicTACC TGTACTGGCG TTGAAATGTT CCGCAAACTG CTGGACGAAG 50 

~ SSSS ?? T ™§ SS SBSSS 

= IS sgsss ssesss sssss 

TGGACG 



15 



20 



25 



30 



50 

2) INFORMATION FOR SEQ ID NO: 55 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 



GTGAGCACAT CCTGCTGGGT 5to55S5 £££££ SSSSSSSS ,22 
10 TTCCTGAACA AGTGCGACAT GGTTGATGAC GAAGAGCTGC TGgS?¥S? 

AGAGAT^ttAA r,TTrn.Tr.A An r^^m^m^™^, 1 wAACTGGT 150 

200 
250 
300 



AGAGATGGAA GTTCGTGAAC TGCTG^CA GTACGATTTC ££2SS£ 

ACAC . CCCGAT cgttcgcggt tctgcgctga aagcgctgga aggcgaSS 
gagtgggaag cgaaaatcat cgaactggct ggccacctgg atacctSS 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG c3£S£££ 

15 acgtattctc catttccggt cgtggtaccg ttgttaccgg tcgtSw? ll« 
cgcggtatca tcaaggttgg tgaagaagtt gaaatcgt?g Ill 

CACCGCGAAA ACCACCTGTA CTGGCGTTGA AATGTTCCGC A^SSc^ 

ACGAAGGCCG TGCT( 5GTGAG AACGTCGGCG TTCTGCTGCG tIStIt^Kg Ui 

, n CG J GAAGAAA TCGAACGTGG TCAGGTACTG gctaagccgg gctcca?c1S I™ 

2 0 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATCCTGTCC SSgaSsAA^ t in 

GCGGCCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCgS GTTCtS?^ Snn 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG nSSSSSS 7 °° 

GGTAATGCCG GGCGACAACA TCAAGATGGT ?G??AC?OTG Znn 

TCGC GATGGA CGACGGTCTG CGTTTC ° ATCCACCCGA 800 

25 826 

2) INFORMATION FOR SEQ ID NO: 56 

30 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

< A > ORGANISM: Entejrobacter hormaechei 
40 <B) STRAIN: ATCC 49162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTT*n*nnm 

45 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CgScSSE SS^g? , 

^nS^ CAA ATGC GACATG GTTGATGACG AAGAGCTGCT f 22 

gagatggaag ttcgtgaact gctgtctcag tacgatttcc cSrSSSS" i 50 

CACCCCAATC GTTCGTGGTT CCGCGCTGAA jScGcSSS S^SSSS 2 °° 

^ iS™ 0QAAGM gaaaatcatc garctggctg gctSSSSJ 

50 CCAGAACCAG AGCGTGCGAT TGACAAGCCA TTCCTGCTPP rI*lnn*»?Z 300 

CGTATTCTCC ATCTCCGGTC GTGGtIcCGT SSSSSS 35 ° 

GCGGTATCAT CAAAGTAGGT GAAGAAGTTG AAATCg5??? SSSIJS^S 400 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATgS2SS 2SS555S^ 450 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TrSJJSSSS AACTGCTGGA 500 

55 GTGAAGAAAT CGAACGTGGT CAGCTTcSS 2^ ATCAAAC 550 

CCACACACCA AGTTCGAATC SJSSSSSS 6 °° 
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5 



10 



2) INFORMATION FOR SEQ ID NO: 57 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



( vi ) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterobacter sakazaku 

(B) STRAIN: ATCC 29544 



rrrrrTATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 
^CGCTATCC TW-li^ii^ GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 
TGAGCACATC CTGCTGGGTC AAGAGCTGCT GGAACTGGTT 

tcSgagS gSgtScag tIcgacttcc CGGGCGACGA 
SSSSSc gttcgtgSS ctgctctgaa agcgctggaa ggcgacgctg 
S^SSSgc gaaaatcatc gagctggcag gtcacctgga ttcctacatc 
?S22Scgg Scgtgcgat tgacaagccg ttcctgctgc cgatcgaaga 
cStSttctcc atctcyggtc gtggtaccgt tgttaccggt cgtgtagagc 

rrrGTATCAT CAAGGTTGGT GAAGAAGTTG AAATCGTGGG CATCAAAGAC 
SSSSSSr CCACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 

SSScot gSggcgaga acgtaggtgt tctgctgcgt ggtatcaaac 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGC CGGG CTCCATCAAG 
SSSSeS AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 
SSSctcac ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 
gScSSSa cSgactggc ACCATCGAAC TGCCGGAAGG cgttgagatg 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 
CGCGATGGAC GACGGTCTGC gtttcgcaat c 



40 2) INFORMATION FOR SEQ ID NO: 58 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 
45 (c) STRANDEDNESS: Double 

( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 0 (vi) ORIGINAL SOURCE: 

(A ) ORGANISM: Enterococcus caeseliflavus 

(B) STRAIN: ATCC 25788 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

55 rnrrcCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 50 

g?gaaSca? CTTOTTATCA CGTAACGTTG GTGTACCATA CATCGTTGTT 100 

TTOTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

?gSatggaa GTTCGTGACT TATTGTCAGA ATATGACTTC CCAGGCGACG 200 

60 ATGTTCCTGT AATCGCTGGT TCTGCTTTGA AAGCTCYTGA AGGCGATGCT 250 
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TCATACGAAG AAAAAATCAT GGAATTAATG GCTGCAGTTG ACGAATACGT 2 on 

TCCAACTCCA GAACGTGACA CTGACAAACC ATTCATGATG CCAGTCGAAG Hn 

nn^I^ TC ™*»CTGGA CGTGGTACTG TTGCTACAgS SSSS^ 400 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTTG GTATTGCTGA 450 

5 AGAAACTGCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTGT Jnn 

TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT MGTGOTGTT tin 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACAAT 600 

CACACCTCAT ACAAAATTTA AAGCTGAAGT TTACGTTTTA aSaAAGAAG tin 

AAGGTGGACG TCACACACCA TTCTTCACTA ACTACCGTCC TC^S£2 ?oo 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATTACCAG AAGGAACTGA ?SS 

AATGGTTATG CCTGGTGATA ACGTAACAAT CGACGTTGAA TTgS^S2c 800 

CAATCGCTAT CGAAGACGGA ACTCGTTTCT CAATT "<*"tCACC 800 



10 



15 



25 



2) INFORMATION FOR SEQ ID NO: 59 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 826 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

GGTGCTATCT TAGTAGTATC TGCTGCTGAT GGTCCTATGC CACAAACTCG 
TGAACACATT CTTTTATCAC GTAACGTTGG TGTTCCATAC ATCGIT™ 
TCTTAAACAA AGTTGATATG GTTGACGACG AAGAATTATT AoSJSSS 
35 GAAATGGAAG TACGTGACTT ATTAACTGAA TACGACTTCC C^SaSI 
TGTTCCTGTA ATCGCTGGTT CTGCATTAAA AGCTTTAGAA ScgJS™ 
^TACGAAGA AAAAATCTTA GAATTAATGG CTGCAGTTO^ SS^aSSJ 
CCAACTCCAG AACGTGACAA CGATAAACCA TTCATGATGC CAGTPGAAnJ 
CGTATTTTCA ATCACTGGTC GTGGTACTGT TGCTACAGGT 
GTGGACAAGT ACGTGTTGGT GACGAAGTTG AAATAGTTGG SSSSSS 
?^ TTCTA AAACAACAGT TACTGGTGTT GAAATGTTCC SISJSS? 
o^ TACGCT GA AGCTGGAG ACAACATCGG TGCATTATTA CGT^TGTC? 
CTCGTGAAGA TATCCAACGT GGTCAAGTAT TAGCTAAACC AGGTTCAATP 
ACTCCACATA CAAAATTCAC TGCTGAAGTG TaSgTTTtS JSS^S 
45 AGGTGGACGT PATAPTPPaT Tr-frr,™ ™» » <^AAAGAAGA 650 

700 
750 
800 
826 



50 
100 
150 
200 
250 
300 

40 GTGGACAAGT n^^^X ^t^tz^i ^'^TTGAAC 400 

450 
500 
550 

AGGTGGACGT JSnSSSS SKKSi SJSotS 

TC CGTACAAC TGACGTTACA GGTGTAGTTA ACTTACCAGA AGGTACTGAA 
^TTATGC CTGGTGATAA CGTAACTATG GAAGTT^ Inn 
AATCGCTATC GAAGACGGAA CTCGTT lAAitLACCC 800 



50 



2) INFORMATION FOR SEQ ID NO: 60 

(i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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™<%To™E™ , *™ neoaea ' dispar 

( B ) STRAIN: ATCC 51266 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

CGGCGCGATC TTCOTAGTAT CTOCTGCTOA TOGTCCTATG gC^gg 

10 tgaaatggaa gttcgtgact tattgtcaga atacga aggcgacgct 

ACACTCCAGT TATCGCAGGT JCAGCTTTGA AA Q ACGAATAT AT 

tcatatgaag aaaaaatctt agaattaaja. £ tgatg ccagtcgaag 

JcCAACTCCA GTTCGTGATA CTGACAAACC ATTCATGA^ TCGTGTTGA a 
i ^ ATGTATTCTC AATCACTGGT CGTGGTACiU u GTATCGCTGA 
15 CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAMgG CGTAAAT TGT 
AGAAACTGCT AAAACTACTG TAACAGGTGT JGAAA^ ACGTGGTGTG 
tSStTACGC TGAAGCTGGC ^AACATTG GTGCATTA^ ^ 
GCTCGTGAAG ATATCCAACG TGGTCAA^lA ii mTA aCTAAAGAAG 

20 cactccacat acaaaatttg cggcagaagt ttacgtti ttctac 
20 aaggtggacg tcatactcca ttcttcacta actaccg ^^^a 

TTC CGTACAA CTGACGTAAC AGGTGTTGTT GgJ^^ TTAA TCCACC 
SS SSS^TCT CAATC 



25 



2) INFORMATION FOR SEQ ID NO: 61 

.0 (i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 835 bases 

i-o) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

<V±)< Sf G ^XVEnterococcus durans 
40 ( B ) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGCCCTATG C^TCAAACTC 

45 S5KK sssss S 

tgSatggaa gttcgtgact tattaacaga JTAcgaatT a aggcg 

ATGTTCCTGT AATCGCTGGT TCAGCTTT^A qq-jGCAGTTG ACGAATATAT 
TCATACGAAG AAAAAATCCT TGAATTjATG ^atqATG CCAGTTGAAG 
50 CCCAACTCCA ^CGTGACA ACGACAAACC FU Q( , TCGTGTTGAA 

ATGTATTCTC RATCACTGGT CGTGGTACTt, 1^ GTATCGCAGA 
CGTGGACAAG TTCGCGTTGG TGACGTTGTA CGTAAATTAT 

aSaacagct caaacaacag ttactggtgt tgaaatgt ct acgtqgtgtt 

TAGRCTACGC TGAAGCTGGA GA^AACATTG Gl CAGGTACAAT 
55 GCACGTGAAG ACATCCAACG TGGACAAGTT TO ACTAAAGAAG 

CACKCCTCAT ACAAAATTCT CTGCAGAAGT Al ACAATTCTAC 

AAGGTGGACG TCATACTCCA TTCTTCACTA AC^ AAGGAACTGA 

60 « S Ss sh?~ ~ cc 



/ 

50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
835 
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2) INFORMATION FOR SEQ ID NO: 62 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 680 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 < A > ORGANISM: Enterococcus faecalis 

(B) STRAIN: R610 



20 



25 



30 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

AGTAGTTTCT GCTGCTGATG GTCCTATGCC TCAAACArrT na*™™™ 
TATTATCACG TAACGTTGGT GTAC C ATACA tSSJSS? S^SSS 

AAAATCTTAG AATTAATGGC TGCAGTTGAC GAATATAT^ ™t£5SS£ 250 

ACGTGATACT GACAAACCAT TCATGATGCC XgtcSJS ^EEP" 3 * 300 

TCACTGGACG TGGTACTGTT GCTACAGGCC S^S CTCAA 350 

CGCGTTGGTG ACGAAGTTGA AATCGTTGGT SSESSJS T??^AG TT 400 

AACAACTGTT ACAGGTGTTG AAATGTTCPr S^^™ AAACATCTAA 450 

AAGCAGGCGA CAACAtSct SSKggg SSJSS 10 500 

ATCGAACGTG GACAAGTATT AGCTAAACCA oSSSSS? ^F^^ 550 

AAAATTCAAA GCTGAAGTAT ACGTATTa5? SSJSffJ^ CTCCACA CAC 600 

ACACTCCATT CTTCACT^AC tStStc AAAAGAAQAA ^CGGACGTC 650 

"~ £T O A 



50 



55 



60 



50 
100 
150 



2) INFORMATION FOR SEQ ID NO : 63 

(i) SEQUENCE CHARACTERISTICS • 
40 (A) LENGTH: 68 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: .Enterococcus faecalis 

(B) STRAIN: R487 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

iSESSS ESSE? KS2S £^ CACGT ^catatct 

ATGGATATGG TTOAlSeX AGAATTATTA SSSS™ ^AAACAAA 1Q q 

TCGTGACTTA TTATCAGAAT JSJ?SS? AAATGGAAGT 156 

TCGCAGGTTC TGCTTTGAAA GCTTTArfS ^^2 ATGAT GT TCCAGTTA 200 

AAAATCTTAG AATTAATGGC SSSSS p?*™™^ TTA TGAAGAA 250 

ACGTGATACT GACAAAc2a¥ ?SS5££§ SSS* 3 " 3 * 300 
TCACTGGACG TGGTACTGTT GCTACAGGCC SSSgSSSg 



50 
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m n^* is. aTPPTTGGT ATTAAAGACG AAACATCTAA 
CGCGTTGGTG ACGAAGTTGA AAJCGTTGGT GACTACGCTG 

SSSS2 SSS5 Egg SKUgg ™- 
ESSE SSK3S 5SS3|| SSSU ooc^coxc «o 

ACACTCCATT CTTCACTAAC TACCGTCCTC 



450 
500 
550 
600 



10 2) INFORMATION FOR SEQ ID NO: 64 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 685 bases 

(B) TYPE: Nucleic acid 
(c) STRANDEDNESS: Double 

lb ( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(B) STRAIN: R482 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 

25 .^n^»rP rrr HAATGCC TCAAACTCGT GAACACATCC 

AGTAGTTTCT GCTGCTGACG gCCCAATGCC CTTGAACAAA 
TATTGTCTCG TCAAGTTGGT JTTCCTTACA ^ TTQ aaatGGAAGT 
GTAGACATGG TTGATGACGA AGAATTACTA GAATT GTTCCTGTAG 

tcgtgaccta ttaacagaat acgaattccc tggt ATACGAAGA a 

TTGCTGGATC AGCTTTGAAA GCTCTAGAA^ CAACTCCAGA 

aaaattcttg aattaatggc tgcagttgac ^acATCc £ tgttctcaa 

ACGTGACAAC GACAAACCAT JCATGATGCC AGTO^A^ TGGAC AAGTT 

ttactggacg tggtactgtt gctacaggtc gtgttga^ ^cttcaaa 

CGCGTTGGTG ACGAAGTTGA AGTTGTTGGT GACTACGCTG 

aacaacagtt actggtgttg aaatgttccg Jaaatt ttgc acgtgaagac 

AAGCTGGAGA CAACATTGGT GCTTlAUim- ~ rTRCAATCA CACCTCATAC 

SS T c? SS fcSS ««« «• 

JfflSSSS StSHtAAC TACCGTCCTC AATTT 



30 



35 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 65 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Vi) S^SJSSSSf ' snterocsccu,, flavor* 
55 ( B ) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

60 = = = = = 



50 
100 
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20 



2 ) INFORMATION FOR SEQ ID NO: 66 

( i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterococcus gallinarum 



35 



(B) STRAIN: R420 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 66 

TCCTATGCCT CAAACTCGTG AACACATCTT RTTiTrir-P-r 

TACCATACAT CGTTGTTTTC ¥?SSSaaI SSSJS? 
GAATTGCTAG AATTAGTTOa utop™ „£_?.£ TGATGACGAA 



50 

2) INFORMATION FOR SEQ ID NO: 67 

( i ) SEQUENCE CHARACTERISTICS • 
(A) LENGTH : 83 5 bases 
55 < B ) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

172 



150 
200 
250 



TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAC TAGAATTAPT 
?S^GGAA GTTCGTGACT TATTGTCAGA ATATGACTTC CCAGGCGACG 
ATGTTCCTGT AATCGCTGGT TCTGCTTTGA AAGCTCTTra irn™^ 
TCATACGAAG AAAAAATCAT GGAATTAATG iEESESS. 

5 TCCAACTCCA GAAC GTGACA CTGACAAACC SS3SS5S ScSJSJS U° 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACaS CCGtItt^ 35 ° 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCotS £2KS2£ Hi 

AGAAACTGCT AAAACAACTG TAACTGGTGT TGAAATGTTC CCtIIw?^ 450 

TAGACTATGC TGAAGCAGGG gataacattg gtgcattgS SJJSSSSI 500 

10 GCTCGTGAAG ACATCCAACG TGGACAAGTA tSSSSSS £S?£2J 550 

CACACCTCAT ACAAAATTTA AAGCTGAAGT tScS™ 600 

aaggtggacg tcacactcca ttcttcacta actaccot?c 650 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATtSSp ISSISS? 700 

IS CCTGGTGA TA AMSTjJSK? SJSJSSS T^gS^S?? Inn 

15 CAATCGCTAT CGAAGACGGA ACTCG il^rATCCACC 80 0 



50 
100 
150 



OAJTT^TAG mSSUSi «?SS£S?? SSS S3£g& 
TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCCOGTTCT SSSSJ 

. ■■■■■ i 

GCTAAAGCTG GTACAATCAC ACCTPATiri JtS™™ ACAAGTATTG 550 
TGTTTTGACA AAAGAAGAAG ™^ A ^ ^ES^ 6 CTGAAGTTTA 600 
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( vi ) ORIGINAL SOURCE: . ... 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 8043 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 
GTGAACATAT CCTAYTATCT CGTCAAGTTG GTGTTCC TAGAATTAGT 
TTCTTGAACA AAGTAGATAT GGTTGAC^AL r'pacgaaTTC CCTGGTGACG 
10 TGAAATGGAA GTTCGTGACT JATTAACAGA ^CGAATTC CCT ^ 
ATGTTCCTGT AGTTGCTGGT *CAGOTJ2£ ^?gCAGTTG ACGAATATAT 
TCATACGAAG AAAAAATCCT TGAATTGATG JCTGCA^ CCAGTCGAAG 
CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG 
ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG JC 
IS CGTGGACAAG TTCGCGTTGG TGACGTTGTA GAT^ CGTAAATTAT 
AGAAACAGCT CAAACAACAG TTACTGGT^l ACGTGGTGTT 
TAGACTACGC TGAAGCTGGA GACAACATTG GTGCTl CAGGTACAAT 
GCACGTGAAG ACATCCAACG TGGACAAGTT JTAGCTAAA ACAAAAGAAG 
CACACCTCAT ACAAAATTCT CTGCAGAAGT ATACG1£ ACAATTCTAC 
20 AAGGTGGACG TCATACTCCA JTCTTCACTA ACTACCU ^ GGAACTGA 

SSSS^S iSSKS g|agttgaa TTAATCCACC 

SSSctS CGAAAACGGT ACTAAATTCT CAATC 



25 



2) INFORMATION FOR SEQ ID NO: 68 

(i) SEQUENCE CHARACTERISTICS: 
, 0 (A) LENGTH: 835 bases 

( B ) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



,5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: m „„rti-ii 

(A) ORGANISM: Enterococcus xnundtia 

(B) STRAIN: ATCC 43186 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

"""KS sssss ssssss sssss sssss 

GTQAACACAT CCTATTATCT ™J«"fiii GAAGAATTAC TTGAATTAGT 
45 TTC ^S S?? C G?Scc TATTAACAGA ATACGAATTC CCTGGTGACG 

s™E iiii sssss ssss^ 
^li Efil li-- ssss ssssss 

50 S t cSg s= ~ -gsssss ss 

S SSKSSEg Si SitS SSSS5 

55 ss gss^; =| II Esssssss 
EE1 IS liis s sess 

C^ATCGCTAT CGAAAATGGT ACTAAATTCT CAATC 



60 
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10 



2) INFORMATION FOR SEQ ID NO: 69 

( i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 83 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Enterococcus pseudoavium 
15 (B) STRAIN: ATCC 4 93 72 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCTATG rPTr^ar&P 

20 ^^° ACAT CTTGTTATCT CGTAACGTTG GTgSStS SSSJJSf , nn 

TTCTTAAACA MATOaTaT P^rr^Amn, LZ-I.lt* <-ATCGYTGTA 100 

150 

200 

25 CCCAACACCA GTTCt^^ n^™^ AC GAATACAT "2 

350 
400 
450 

30 3CACGTGAAG A aTtpp^pp: rA ill^ i ACGTGGTGTT 55^ 

600 

TTCCGTACAA 6iS55nSS TGGTGTTGTT 52S5KSS ?S 

» ^° TAATG CCT ^TGATA ACGTAACTAT GGAAGt5S5 2 J 

35 CAATCGCGAT CGAAGACGGA ACTCGTTTCT CTATTC ttaatc CACC 800 

836 



TTCTTAAACA AAATGGATAT GGTTGATGAC SSSSS SS 
TGAAATGGAA GTTCGTGACT TATTGTCAGA ATACGATTTC 
JSJSJS£? T TATCGCT <*>T TCAGCYTTgK SScSSS AGgI^S 
TCATACRAAG AAAAAATCTT AGAATTAATG SCTGCTGTTG ACGAAtSS 
CGCAACACCA GTTCGTGATA CTGACAAACC ATTCATgSg CCaSJSSS 
ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCAACTGG raSSSJ? 
CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTAG CTATCgSgA* 
^^Sr GCT AAAACAACTG TTACAGGTGT TGAAATGTTC CGTAAaSSJ 

^ ga ^acgc tgaagcaggc gataacatcg gtgcattatt acgtogtSt? 
GCA S^ GAAG acatccaacg tggacaagta ttggctaaa? cagctcSX? 
cactccacat acaaaattct ctgcagaagt ttacgtttta aotSSJ? 
aaggcgggcg tcacactccg ttcttcacta actac?gtcc 

TTCCGTACAA CTGamTisr ta~™ „,„„_?_f C ? TCAGTTCTAC 



40 



2 ) INFORMATION FOR SEQ ID NO: 70 

( i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

HIE Si St ~ — ™ ssss 
~ iiE SE Sil 

TCATACGAAG AAAAAATCTT AGAATTaIS ATGAATACAT 



174 
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10 



ccca^cc ™ S££™£ 

SfcS gEKSj ggrggo 

AGAAACTGCT AAAACAACTG TTACAGGTGT JGAAA^ ACGTGGTGTT 

tSSttacgc tgaagcgggc gacaacattg gtgcat^ cagcttca ^t 

GCACGTGAAG ACATCCAACG TGGACAA^IA u ACTAAAGAAG 
SSCACAT ACAAAATTCT CTGCAGAAGT JTACGXTT^ TCAGTTCTAC 
AAGGCGGACG TCATACTCCA TTCTTCACTA ^ CTACCAG AAGGTACTGA 
TTCCGTACAA CTGACGTAAC TGGTGTAGTT XTAATCCACC 

SSSSSS ACTCGTTTCT CTATT 



350 

400 

450 

500 

550 

500 

650 

700 

750 

800 

835 



1S 2) INFORMATION FOR SEQ ID NO: 71 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 5 bases 
in) TYPE: Nucleic acid 
c STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
25 (vi)ORIOINAL^URCE: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43076 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 71 
30 CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 
GTGAACACAT CTTGTTATCT CGTAACGTAG T AGAATTAGT 
TTCTTAAACA AAATGGATAT ^TGATGAC J^xam^ CCAGGCGATG 
AGAAATGGAA GTTCGTGACT TATTATCAUJ TTTAQA AGGCGATCCA 

3 5 ACACTCCAGT TATTGCAGGT JCTGCTTT GCTGCAGTTG ACGAATATAT 

GTTTACGAAG AAAAAATCTT tv*rr ATTCATGATG CCAGTTGAGG 

CciScTCCA GAACGTGATA CTGAAAAACC MTCWffl^ TCGTGTTGAA 
ATGTATTCTC AATCACTGGT CGTGGTACTG TT^ q gtatcgacga 
CGTGGACAAG TTCGCGTTGG TGACGTTG1A ^g^^^ CG TAAATTAT 

4 0 AGAAACAGCT CAAACTACTG TAACAGGTGT TGAAA^ ^ 

TAGACTACGC TGAAGCAGGC GATAACAT^ MC CAGGAACAAT 

GCTCGTGAAG ACATCCAACG TGGACAA^IA ACTAAAGAAG 

ssssss gsg IS is =11 
45 « IS lass ssr" — 

50 2 ) INFORMATION FOR SEQ ID NO: 72 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

( B ) TYPE: Nucleic acid 
5b (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 

175 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
835 
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10 



15 



20 



25 



35 



40 



45 



50 



55 



(vi) ORIGINAL SOURCE • 

(A) ORGANISM: Enterococcus Bolitarius 

(B) STRAIN: ATCC 49428 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

SS SS5SS SSSSS5 ?S CCAATGGG acaaactcgt 

CTTGAACAAA S553S SSSSS 6 * TCGTT ^TGTT ioo 

aaatggaagt acgtgatSa SSSSSt" gag ttagttg 150 

ACTCCAGTTA TTTCCGGTTC AGCTTTGAAA rr^ll^ C AGGAGATGAT 200 

ATATGAACAA AAAATTATGG SISgS? GCG ACGAAGA 250 

CAACTCCTGA ACGTGAcSS SaCaUSS JSJSESK SSSf*™ 300 

gtattttcaa ttacaggccg tggtactgtt g™ A SS ^^aagat 350 

CGGGACTATC AAAGTCGGCG ATGAAGTOgI SESKSS 2552**°° 400 

ACGTTAAAAA GACAACAGTT ACTGP tp t an a^Z-I GT ATTCATGAAG 450 

gactacgctg aagcaggcga tISSSS? J**™^ 000 taaattgttg 500 

TCGTGATGAT ATCGuSSSS JS5535E GTGGTGTTTC 550 

caccacatac aagattctct gctga^g??? a?S™?^ ggttcaatca 600 

GGCGGACGTC ATACTCCATT r«m£m??£ ATGTTTTGAC TAAAGAAGAA 650 

CCGTACAACT CTG^CA^ JJ™ J*™ 1 *™ "2 

TGGTAATGCC AGGTGATAAT GT A A raiTPP A ^~™S AGAA GGTACTGAAA 750 

GTCGCTATCG AAGAAGGaIJ tIT ATGTTGAATT AATCCACCCA 800 



2) INFORMATION FOR SEQ ID NO: 73 

(i) SEQUENCE CHARACTERISTICS- 
J0 (A) LENGTH: 665 bases 

(B) TYPE: Nucleic acid 
CO STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



60 2) INFORMATION FOR SEQ ID NO: 74 

176 



50 



( i i ) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE- 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 m 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

SSSSSS £ggg£ 2SSSSSS KKS?" AAACACGGGA au 

ATCGAAGTCA SSSSgg SEES ~SS S ^ 

TCCTGTGATC aS£2S?£ SSSSS? GCGACGACAT 20S 

CTGAAGCAGC GATCTTAACG CTTATr™™ GAAGGG GACCCAGATG 250 

ACGCCAGAAC OmSSJS SSSSSS SSSSS A ^ AT ° CCA 3 °° 

CTTTTCGATC ACAGGACGGG GGACCGTTGC TCGAAGATGT 350 

GCATGGTAAA AGTCGGGGAT GAaStaSS? SEES™*? 0 ATCGATCGCG 4 00 

ACACAAAAAG CAGTCGTGAC AGGGr???^ ^S 00 ^ 1 " CAAACCTGAA 450 

CTTCGGAGAA GCTGGCGATA fJSSSS 01 AAACGATGGA 500 

GTGATGAAAT TGAACGTGGC CaSSa? S*a^ GG GGC ATCACCC 550 

CCGCATACGA AATTCCAAGC GGAAototJ? Z^^Z 0 * 00 T TCTATCACA 60 0 

CGGTCGCCAT ACCCC GGAAGTCTAT GTGTTGACAA AAGAAGAAGG 650 



50 



665 
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( i) SEQUENCE CHARACTERISTICS : 

( A ) LENGTH: 664 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
« <* 1) S? B, £ a 35£?' Enterococcus faeci- 

(B) STRAIN: ATCC 19434 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 
15 AATCTTAGTT ™GCAA CTGACGGTCC GATGCCTCAA ACACGTGAAC 

AACAAAGTTG S3£3£ |S| TTTCCCAGGC 3S2S2S 
SSaAGTTCGC GAGTTATTGA GCGAATATGG TTTCCCAGGC ^ qaGATGCT 

20 CTGTGATCAA AGGTTCCGCA TTAAAAGCAT i£ ATATCCCAAC 
GAAGCTGCTA TTATGGAATT G^^TACA ^ GAAGACGTCT 
ACCAGAACGT GATACAGATA J^CCATTACT ATT TGATCGTGGT 
TCTCAATTAC AGGTCGAGGA ACTGTTGCCT ^ AACCTGAAAC 
GCTGTTCGTG TCGGTGATGA GGTAGAGATC ™* ACGTTAGATT 

25 ACAAAAAGCA GTTGTAACAG GTGTAGAAAi ^ QGGG QATCCAACGT 
ACGGGGAAGC TGGGGATAAC JTAGGCGTGT Tj^Jgg^ CCATTAC TCC 
2SSSS SSSSS SS GTTGACAAAA GAAGAAGGTG 650 
CACGTCATAC TCCA 



30 



2) INFORMATION FOR SEQ ID NO: 



75 



^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(Vi)( ?S GIN O^GASSS? : Enterococcu S flavescens 
45 ( B ) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 
GCAATCTTGG TCGTATCAGC JACAGATGGC CCAATGCCAC AAACACGGGA 
50 GCATATTTTG CTTTCTCGTC AAGTGGGTGT JAAA^J,J A XTTAGTT GAA 
TGAATAAGAC GGACCTTGTC ^GATGACG JGTTGATCGA ^ 
ATGGAAGTCA GAGAATTGCT GACTGAATAT GATTTOU^ q^ccagATG 
TCCTGTGATC AAGGGCTCTG CGTTAAAAGC CTTGGAA^ ATATATCCCA 
CTGAAGCAGC GATCTTAACG CTGATGGATA CGGTAGATGA A 
55 ACGCCAGAAC GTGATACTGA CAAACCATTG TTGTTACC ATCGATCGCG 
CTTTTCGATC ACAGGACGGG ^gCGTTGC TTCTGOi^ cAAACCTGAA 
GCATGGTAAA AGTCGGGGAT JJA^GAAA JCGl aaaCGATGGA 
ACACAAAAAG CAGTCGTGAC AGGGGTAGAA ATGl 1 qqcaTCACCC 

60 gJSSSSS S2ESSSS A caIaaccagg ttctatcaca 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
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Sg^GcS jSSS"^ GGAAGTCTAT GTGTTGACAA AAGAAGAAGG 



2) INFORMATION FOR SEQ ID NO: 76 

< i ) SEQUENCE CHARACTERISTICS ■ 
(A) LENGTH: 667 bases 
10 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



25 



30 



40 



2) INFORMATION FOR SEQ ID NO : 77 

( i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH : 834 bases 

(B) TYPE: Nucleic acid 
f c > STRANDEDNESS: Double 

45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE- 
50 (A > ORGANISM: Ehrlichia canis 

(B) STRAIN: Florida 



55 



60 



650 
666 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Knterococcus gallinanm 

(B) STRAIN: ATCC 49573 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO- 76 

sssss ssssss ssssffi EEs EBB *» 
sssssss skes =E S iiil - 

TCTTTTCGAT TACTGGACGA GGAACAGTTG CTTcSSSS SSS^S** 6 350 

GGAACAGTTC GGGTAGGCGA TGAAGTAPAfl zlnn^-S? GATCG ATCGG 400 

AACCCAAAAA GCTGTAGTGA oS22SSf ??52^S£ TCAAACCTGA 450 

ACTTTGGGGA AGCCGgISS tcSgSSc JSS2S? 30 500 

CGTGACGAAA TTGAACGAGG ACAAGTGTTG AGGGATCACT 550 

ACCACATACA AAATTCCAAG CAGAACTtS TCTaTT^arn OTTCGATCAC 600 
35 GTGGTCGTCA TACACCA "-auaagttTA TGTATTGACG AAAGAAGAAG 



50 



650 
667 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

SSS3SS SSK5S SS bo 

a sssss 3 s sseje? & 0 

ATATAGATGT AGTTaSSgJ TC?££g?££ ?FiI?22™? CCTOG «3*ro 2 iS 
GGCTCAGGTG TGTGGAGTGA £££££ £££££ jSSSgS2 



250 
300 
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AAAAATAAGT TTACCAGTAA GAGAAAAJGA TAAGCCATTT ^AATGTCAA 
TAGAAGATGT GTTTTCAATA JCTGGAAGAG JTACAOT^ GQATT 
ATAGAAAGAG GAGTAATTAG AGTAGGGGAT AAAATA^ TTTCATAAAG 
GCGTGAGATA CAAAGTACAG TATGTACAGG gl^J GTTAAGAGGG 
CATTAGATGC AGGAGAAGCA GGGGATAATG CTGGAATA1 c^CCTGGACA 
ATAAAAAAAG AAGATGTAGA AAGAGGGCAA GTATL "J* TTGAAAAAAG 
GATACATTCA TATAAGAGAT TTAAGGCAGA g^^ATATA 
AAGAAGGAGG AAGACATACT ^CATTTTTCT £AAATTA CAGAAGGAGT 
SS SSSSS ATAATATAAA TATCGAAGTG AGTTTGGATA 
AGCCTG^GC StGATCAA GGATTGAGAT TTGC 



350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

834 



15 



20 



2) INFORMATION FOR SEQ ID NO : 78 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

9 c (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coll 

(B) STRAIN: ATCC 23 511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

TTCCTGAACA AATGCGACAT GGTTGATGAC ™ CCGGGCGACG 
TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ^ACGACTTC CC^^ 
35 ACACTCCGAT CGTTCGTGGT TCTGCTCTUA ^ ATTCTTAYAT 
GAGTGGGAAG CGAAAATCCT GGAACTGGCT °GCTT^^ CCGATCGAAG 
TCCGGAACCA GAGCGTGCGA TTGACAAGCC ^TCCTGCTG CC^^ 
ACGTATTCTC CATCTCCGGT CGJGGTACCG J^^GTTG GTATCAAAGA 
CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAAT^J. J£ j^^q^qq 

CGTGAAGAAA TCGAACGTGG ^A^^^ patTCTGTCC AAAGATGAAG 

ssss us S Hsb S3SS 

45 gg^SS £™X TGTTACCCTG &TCCACCCGA 

TCGCGATGGA CGACGGT 



50 



2) INFORMATION FOR SEQ ID NO : 79 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 5 bases 
c-c (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

179 
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25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia fergusonii 

(B) STRAIN: ATCC 35469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

CGATCCTGGT AGTTGCTGCG ACTGACGGCC CGATGCCGCA GACTrPTrar 
CACATCCTGC TGGGTCGTCA GGTAGGCGTT CCGTACATCA TCGTGTTCCT 
GAACAAGTGC GACATGGTTG ATGACGAAGA GCTGCTGGAA CTOotJSS 
TGGAAGTTCG TGAACTTCTG TCTCAGTACG ACTTCCCGGG SaSI^ 
CCGATCGTTC GTGGTTCTGC TCTGAAAGCG CTOgSSS 25S5SS 
GGAAGCGAAA ATCC TGGAAC TGGCTGGCTT CCTGGATTCT TACATTcSS 
AACCAGAGCG TGCGATTGAC AAGCCGTTCC TGCTGCCGAT OgS^Sto 
TTCTCCATCT CCGGTCGTGG TACCGTTGTT ACCGGTCGTG TAGaSSSS 
TATCATCAAA GTTGGTGAAG AAGTTGAAAT CGTTGGTATC AAAGAGACTC 
AGAAGTCTAC CTGTACTGGC GTTGAAATGT TCCGCAAACT GCTgSatS? 
GGCCGTGCTG GTGAGAACGT AGGTGTTCTG CTGCGTgSS 

agaaatcgaa cgtggtcagg tactggctaa gccgggcacc atcaag??gS 
acaccaagtt cgaatctgaa gtgtacattc tgtccaaaga tgaaggSg? 
cgtcatactc cgttcttcaa aggctaccgt ccgcagttct actc^cta? 

TACTGACGTG ACTGGTACCA TCGAACTGCC GGAAGGCGTA GaS5SSS 
TGCCGGGCGA CAACATCAAA ATGGTTGTTA CCCTGATCCA CCCgIS?S? 
AT13GACGACG GTCTGCGTTT CGCAA ^ ^CGATCGCG 800 



2) INFORMATION FOR SEQ ID NO: 80 



(i) SEQUENCE CHARACTERISTICS- 
30 (A) LENGTH: 829 bases 

(B) TYPE:. Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 



825 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC cmmnnnn 
TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCgSS SSa??^S 50 
TCCTGAACAA ATGCGACATG GTTGATGACG 2S££S£S 

gagatggaag ttcgcgaact gctgtcccag tacgatttcc cggSSS? if ° 

CACCCCGATC GTTCGTGGTT CCGCGCTGAA AGCGCTGgS SSSrS f° 

agtgggaaga gaaaatcatc gaactggctg gctacSSS SSt1?Sp ™2 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC ^ATrrlSn* !°° 

cgtattctcc atctccggcc gtggtaccgt tgttatcg?? 5™??™?S 350 

GCGGTATCAT CAAAGTGGGT GAAGAAGTTG AAATcSJgS TA^rl^?? 400 
ACTGCGAAAT CAACCTGTAC CGGCGTTGAA aISttSgS AA^gISSS 
CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT tctgctgcgt 2£5IS™?a£ 

SSSSS SBSKS =s sill ™ 

K3K3S SS25S5 22»~ SS 



450 
500 

55 CCKCACACCA AGTTCC^ATr ^ 71 ^ 1 ^f^lf * rT C CAT CAAG 

sssia S S oliE HH « 

CGCGATGGAC GACGGTCTGC GTTTCGCAA GTTACCCTGA TCCACCCGAT 8 00 

60 829 

180 
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2) INFORMATION FOR SEQ ID NO: 81 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulnens 

15 ( B ) STRAIN: ATCC 33821 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 

GAGTGGGAAG AGAAAATL^i ^X^rXnnn rTTrrTnrTG CCGATCGAAG 
25 CCCGGAACCA GAGCGTGCGA ™AO£GCC ^TTCCTGCTG CCGAT^^ 

iiii HIE 33 ssssss ssssaa 
3S S Iiii issss ss ssssss 

30 CGTGAAGAGA TCCAGCGTGG rATCCTGTCC AAAGACGAAG 

35 TCGCGATGGA CGACGG 



40 



2) INFORMATION FOR SEQ ID NO: 82 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 ( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A ) ORGANISM: EuJbacterauni lentum 

(B) STRAIN: ATCC 43055 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 
55 CGGCGCCTCC iwri^ ^nrrPTAC ATCGTGGTCT 100 



60 



rnrrGCCTCC TCGTTATCGC CGCCACCGAC GGCCCGATGG CCCAGACCCG 
^nn^nl^r PTGCTCGCCC GTCAGGTCGG CGTGCCCTAC ATCGTGGTCT 
SSSSSS GTGCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 

SSSotSg ttcgcgagct gctcgactct tacgagttcc cgggcgacga 

S^o^A^r aTrrrCGGCT CCGCTTTGAA GGCCCTCGAG GGCGACAAAG 

SSSSS SgSStgg acgccgtcga ctcctacatc 



150 
200 
250 
300 



181 
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CCGACGCCGG AGCGCATGGT 
CACGATGACC ATCACCGGCC 
GTGGTACGCT GCATGTCAAC 
ACCCAGAACA CGGTCTGCAC 
5 CGAGGCTCAG GCCGGCGACA 
GCGAGGAGAT CGTTCGCGGC 
CCGCACACCG AGTTCGAGGG 
CGGCCGCCAC ACGCCGTTCT 
GCACGACGGA CGTGACGGGT 
10 GTCATGCCGG GCGACAACGT 
CGC CAGGAAG AGGGCTGCGT 



CGACAAGCCG TTCCTGATGG 
GCGGCACCGT TGCCACCGGT 
GACCCGCTGG AGATCGTCGG 
CGGTATCGAG ATGTTCCGCA 
ACATCGGCTG CCTGCTCCGC 
CAGGTTCTCT GCAAGCCCGG 
TCAGGTCTAC ATCCTGACGA 
TCGACGGCTA CCGTCCGCAG 
GTTGCCCACC TTCCCGAGGG 
GGAGATCAAG GGCGAGCTCA 
TCGCTAAC 



CCGTCGAGGA 350 

CGTGTGGAGC 400 

TATCAAGGAG 4 50 

AGCTGCTCGA 500 

GGTGTCAAGC 550 

TAGCGTGACC 600 

AGGAAGAGGG 650 

TTCTACTTCC 700 

CACCGAGATG 75 0 

TTCACCCGAT 80 0 
828 



15 2) INFORMATION FOR SEQ ID NO: 83 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Euba cterlum nodatum 

(B) STRAIN: ATCC 33 099 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

*™ T ° TGGTTTGTGC AGCAACKGAC GGACCAATGC CTCAGACAAG 

AGAACATATC CTTTTGTCAA GGCAGGTAGG AGTGCCATAT ATCATCGTAT 

GT ^TGAYG AAGAgS?^ SScSgGtI 



50 

GAGATGGAAG TAAGAGAACT TCTCAGTGAG TATGAATTCC C^GGAGATGA 

300 
350 
400 
450 
500 
550 
600 
650 



GAGAATGGGC AGACAAGATT GTAGAGCTGA TGGAGGAAGT AGATAAATAC 
ATTCCTGAAC CAAAGAGAGA TAACGACAAA CCGTTCCTGA TGCCtS?I?a 
GGACGTATTC TCAATAACAG GAAGAGGAAC AGTAgSScA SSgrg™ 

aaagaggaat cctgaaggtc ggtgatgaag tagaaatcgt S^?2K? 

40 °^ GAGAA GAAAGGTAGT AGTAACGGGA GTTGAAATGT TCAGAAAGCT 
TCTGGATGAA GCAGAGACAG GAGACAACAT CGGAGCACTG CTGAGAgS^ 
TTCAGAGAAC RGAGATCCAG AGAGGTCAGG TATTGGCRG? Sc^aSS 
^™ CCCAC ATA CAAAGTT CAAGGGTCAG GTATATGTAC TgSgAA^Sa 
« A ^ GAGGA A GGCATACC3C CGTTCTTCAA YGGATACAGW CCACAGTTCT 
45 ACTTCAGAAC AACAGACGTA ACAGGAGATT TGCAGCTGCC GGAAGGARCA lln 
GAGATGTGCA TGCCGGGAGA TAATGTGGTA ATGAACRTCA GCCToSSSS flnS 
TCCGATTGCT ATAGAAGAGG GWCTGAGATT TGCCA GCCTGATCAC 800 

83 5 



2) INFORMATION FOR SEQ ID NO: 84 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



182 
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(B) STRAIN: ATCC 33852 
5 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 84 

GCGATCCTGG TTGTTGCTOC A^TGATGGC CCgggg AGACTCGTGA 
GCACATCCTG TTGGGTCGYC AG GTTGGC^i iv.w* AC TGGTAGAA 
TGAACAAATG CGACATGGTT gATGACGAAG AGCTGCTGGA 
10 ATGGAAGTTC GYGAACTTCT GTCTGCTTAC GAll ^ 

CCCAGTTGTT AAAGGTTCAG CGCTGAAAGC AL C TACATCCCA 
GGGAAGCTAA GATCATCGAG CTGGCTGGCC £^ TCGAAGACGT 
GAACCAGAGC GTGCGATTGA CAAJCCATTC CT GTAGAGCGCG 
ATTCTCCATC TCCGGTCGTG GTACCGTTGT ia CAAGGACACT 
15 STATCATCAA AGTTGGCGAA GAAGT^AA TCGTT TGCTGGACGA 
GTTAAGTCTA CTTGTACTGG CGTTGAAATG T ATCAAGC GTG 

AGGCCGTGCY GGTGAGAACG TTGGTGTTCT ^ AATCAAACCA 
AAGACATCGA ACGTGGTCAG G™GGCTA AA Q ATGAAGGC GG 

20 SgS 

SSSSES SSSES 3S33SE ACCCTGATCC ACCCAATCGC 
SmStoac GGTCTGCGTT TCGCAA 



2) INFORMATION FOR SEQ ID NO: 85 

( i ) SEQUENCE CHARACTERISTICS : 
^ (A) LENGTH: 828 bases 
JU (B ) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ^ GIN ^S : Franci S ella tularensis 

(B) STRAIN: LVS 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 
TGGTGCTATT CTAGTATGTT CTGCTGCGGA T^TCCTATG CCA^gTC 
GTGAGCACAT TCTGCTTTCT CGTCAAGTTG ^TGTACCAAA tagagctagt 
TTCTTAAACA AGTGTGACAT ^GATGAT GAA CCAGGTGATG 
TGAGATGGAA ^TCGTGAGC T™GATCA GT*L AGGTGACGAA 
ACACTCCAGT TATTATGGGT TCAGCTUl ia £~ ATGACTATAT 
GCTTACGTTG AGAAAATTGT JGAGCTAGTT «MKTO» cCGATCGAAG 
TCCTGCTCCT GAGCGTGATA CTGAGAAGCC ATJlAi TCGTATTGAG 
50 ATGTATTCTC ^TTTCAGGT CGTGGTACTG TTOTjA^ GTATT 

CGCGGTGTAG TTAACGTTGG TGATGAA^iX ^tqttccGT AAGCTTTTAG 
AACTCAAAAA ACTACAGTAA JTGGTGTGGA AATCT TGGACTTAAG 
ATAGAGGGGA AGCTGGTGAT AACGTTGGTA GTTCAATTAA 
AGAGATGATG TTGAGCGTGG ^CAAgTATTA JGTAAG 
55 GCCACATACT ^TTTGAAG JTGAGGTTTA £GTA^ ATTCTACTTC 
GTGGTAGACA TACTCCATTC JTCAAGGGAT AX GTGTAGAAAT 
SSSSS SS ™SSSS TATCACTCTA ATTAACCCAA 
??gSaggat GAAGGGTTAC GTTTTGCA 



40 



45 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
826 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
828 



60 
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2 ) INFORMATION FOR SEQ ID NO: 86 



10 



35 



40 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

! A J ORGANISM: Fusobacterium nucleatum subsp. polymorphism 
15 (B) STRAIN: ATCC 10953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 

CGGTGCTATC ATCGTAGKTG CTGCTACTGA TGGTC CGATG CCTCARACTC 
™^£ T OYTGCTGGCT CGTCAGGTAA ACGTWCCKAG ACTGGTTGTA 
TTCATGAACA AGTGYGACAT GGTAGACGAC GCTGAAATGY TGGAACTCGT 
TGAAATGGAA ATGCGTGAAC TGCTTTCAGC YTACGAATTC GAYGGYGACA 
ACACTCCKTT CATTCAGGGT TCTGCTCTTG GTGCRTTGAA YGGCGTTGAA 
AAGTGGGAAG AGAAGGTTAT GGANCTGATG GATGCTTGCG ACACTTGGAT 
25 TCCTTTGCCT CCGCGTGATA TTGAYAAACC GTTCTTGATG CCGgSg^G 
ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCTACTGG TCGTATCGA? 
GCTGGTGTTA TCCATGTAGG TGACGAAGTT GAAATCCTCG CTT^CTct 
JSSS 1 ™^ TCTGTTGTA A CTGGTGTTGA AATGTTCCGC AAgSgOTGG 

ATCAAGGTGA agctggtgac aacgtaggty tgttgctccg tggtatcga? 

3 0 AAGAACGAAA TCAAACGTGG TATGGTTCTT TGTAAGCCCG ctSg^SSJ ton 

650 
700 



2) INFORMATION FOR SEQ ID NO : 8 7 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella haemolysans 

(B) STRAIN: ATCC 10379 



55 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 



ACCTCACTCT AAGTTCAAAG CTTCT.TCTA £5£££S ^SSSS 
GTGGTCGTCA CACTCCGTTC CACAACAAAT ACCGTCCTCA JS^SiS 
CGTACTATGG ACTGTACAGG TGAAATCWCT CTTCCGGAAG SJSSSS 
GGTAATGCCT GGTGATAACG TAGAAATCAC TGTAGAACTG ATCTACcJSg 
TAGCATTGAA CGTAGGTTTG CGTTTCGCT ATCTACCCGG 800 



829 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

CTATCTTAGT AATCGCTGCT ACAGATGGAC CAATGGCTCA AACTCGTPAr 
J ATCTCGTAA CGTTGGAGTA CCAAAAaJS TOctSct? 
AAACAAATGT GATATGGTTG ATGACGAAGA GTTATTAGAA TTAGTTGAAa , 2n 

TGGAAGTTCG TGAACTATTA TCTGAATACG GATTCGACGG Jpat™*^ 150 
CCAGTAATCA AAGGTTCTGC TCTTAAAGCT CTTCaSS? J^n^™ 200 
60 AGAAAAAGCT ATCATCGAAT TAATGGAAAC AGTTGACGAA TACATCCCAA ll° 0 

184 
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10 



apcaTTCA TGATGCCAGT TGAGGACGTA 
CTCCAGAACG TGATAACGCT AAAJCATTCA TGA TTGAACGTGG 
TTCTCAATCA CAGGTCGTGG TACAGTTGCT ACTS ACTGAAGAAC 
ACAAGTTAAA GTTGGAGACG TAGTAGAAA^ xGTTCCGTAA ATTATTAGAT 
CAGCTTCAAC TACTGTAACA ^GTTGAAA TG GTGTTGCTCG 
TACGCTGAAG CAGGAGATAA CAJCGGTGCA ^ ACAATCACTC 
TGAAGACATC GAACGTGGAC ^TTTAGC ^ agAAGAAGGT 
CACACACTCA ATTCGTAGCT GA ^TGTA^ pp^rCTCAAT TCTACTTCCG 

SSgtcaca ctccattctt cacaaactac cgtcctcaat actgaaatgg 

T T ^S GGATAACGTA ZSSSSSZ TAGAACTTAT TTCTCCAATC 

SSSSSag AAGGAACTCG tttctcaa 



350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

828 



15 



2) INFORMATION FOR SEQ ID NO: 88 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
TYPE: Nucleic acid 
STRANDEDNESS : Double 



(B) TYPE: Nucleic acid 
on (C ) STRANDEDNESS: Dcr^ 

20 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

25 ivilom™^^^, morti Jloru» 
(B) STRAIN: ATCC 27824 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

30 TCTTAGTAAT CGCTGCTACA OWgg™ KSSSS 

ATCCTATTAT CTCGTAACGT TGGAGTACCT £AAA^ GAAATGQ 
CAAATGTGAT ATGGTTGATG ACGAAGAGTT ATTAGAA1 X ACCA 
AAGTTCGTGA ACT ATTAT CT GAATACGGAT ™££GA^ CAGATGCTGA 

3 5 GTAATCAAAG GTTCAGCTCT TAAAGCTCTT GAAG^ ATC CCAACTC 

Laagctatc atcgaattaa tggaaacagt tgacgagtac ggacgtgttc 

CAGAACGTGA TAACGCTAAA JCATTTATGA TGCLAU TQGACA 
TCAATCACAG GTCGTGGTAC AGTTGCTACT ™ QAAGAACCAG 
AGTTAAAGTT GGTGACGTAG ^GAAATCGT TGGATTAACT attagattac 
40 CTTCAACTAC TGTAACAGGT ^TGAAATGT T TGCTCGTGA 
GCTGAAGCAG GAGATAACAT gGTGCATTA TTAC ^ ATCAC TCCAC 
AGATAT CGAA CGTGGACAAG ™AG^GC T ^ AGAAGGTGGA 

ATACTCAATT CGTAGCTGAT ^TGTACGIAX ACTTCCGTAC 
CGTCACACTC CATTCTTCAC ^TACCGT JCACAA^ 

45 SSSSSS ctSSS I™ AACTTATTTC TCCAATCOCT 
ATCGAAGAAG GAACTCGTTT CTC 



50 



2) INFORMATION FOR SEQ ID NO: 89 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

( B ) TYPE: Nucleic acid 
" (c) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

185 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
823 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Haemophilus actinomycetemcomitans 

(B) STRAIN: ATCC 33384 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 



GCTATCTTAG 

GCACATCTTA 

TAAACAAATG 

ATGGAAGTTC 

CCCAATCGTA 

GGGAAGAAAA 

GAACCTGAGC 

GTTCTCTATC 

GTATCATCCG 

GCAAAAACCA 

AGGTCGTGCG 

AAGAAATCGA 

CACACTGACT 

TCGTCATACT 

CAACTGACGT 

ATGCCTGGCG 

GATGGACCAA 



TAGTAGCAGC 
TTAGGTCGCC 
CGACATGGTA 
GTGAACTTCT 
CGCGGTTCTG 
AATC CTTGAA 
GTGCTATCGA 
TCCGGTCGTG 
TACCGGTGAT 
CCGTAACCGG 
GGTGAAAACA 
ACGTGGTCAG 
TCGAATCTGA 
CCATTCTTCA 
AACCGGTACT 
ATAACATCAA 
GGTTTACGTT 



AACCGACGGT 
AAGTAGGTGT 
GATGACGAAG 
TTCTCAATAT 
CATTAAAAGC 
TTAGCAAACC 
CCAACCGTTC 
GTACCGTAGT 
GAAGTTGAAA 
TGTTGAAATG 
TCGGTGCATT 
GTATTGGCGA 
AGTGTACGTA 
AAGGTTACCG 
ATCGAGTTAC 
AATGACCGTA 
TCGCTATCG 



CCTATGCCAC 
TCCTTACATC 
AGTTATTAGA 
GACTTCCCGG 
GCTTGAAGGC 
ATTTAGATAC 
CTTCTTCCAA 
AACGGGTCGT 
TCGTGGGTAT 
TTCCGTAAAT 
ATTGCGTGGT 
AAC CGGGGTC 
TTGTCCAAAG 
TCCACAATTC 
CTGAAGGCGT 
TCCTTAATTC 



AAACTCGTGA 50 

ATCGTATTCT 100 

ATTAGTTGAA 150 

GCGATGACAC 2 00 

GATGCCGCAT 250 

TTACATCCCG 3 00 

TTGAAGATGT 350 

GTTGAGCGCG 400 

CAAACCGACT 450 

TACTTGACGA 50 0 

ACTAAACGTG 55 0 

AATCACCCCG 600 

AAGAAGGTGG 65 0 

TATTTCCGTA 700 

GGAAATGGTT 75 0 

ACCCAATTGC 800 
829 



2) INFORMATION FOR SEQ ID NO : 90 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus aphrophllus 

(B) STRAIN: ATCC 33389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 



TGGTGCTATC 

GTGAGCACAT 

TTCTTAAACA 

TGAAATGGAA 

ATACACCAAT 

GAATGGGAAG 

TCCTGAGCCA 

ACGTGTTCTC 

CGTGGTATCA 

GACTGCGAAA 

ACGAAGGTCG 

CGTGAAGAAA 

TCCGCACACT 

GTGGTCGTCA 

CGTACAACTG 

GGTTATGCCT 

TCGCGATGGA 



TTAGTAGTAG 

CTTATTAGGT 

AATGCGACAT 

GTTCGTGAAC 

CGTACGTGGT 

AAAAAATCCT 

CAACGTGCTA 

TATCTCCGGT 

TCCGTACCGG 

ACTACCGTAA 

TGCAGGTGAA 

TCGAACGTGG 

GATTTCGAAT 

TACTCCATTC 

ACGTAACCGG 

GGCGATAACA 

CCAAGGTTTA 



CAGCAACTGA 

CGCCAAGTAG 

GGTAGATGAC 

TTCTTTCTCA 

TCTGCATTAC 

TGAATTAGCA 

TCGACCAACC 

CGTGGTACTG 

TGATGAAGTT 

CCGGTGTTGA 

AACATCGGTG 

TCAAGTATTG 

CTGAAGTGTA 

TTCAAAGGTT 

TACTATCGAG 

TCAAAATGAC 

CGTTTCGCTA 



TGGTCCTATG 
GTGTTCCTTA 
GAAGAGTTAT 
ATATGACTTC 
AAGCGTTAAA 
AACCACTTAG 
GTTCCTTCTT 
TAGTAACAGG 
GAAATCGTAG 
AATGTTCCGT 
CATTATTACG 
GCTAAACCGG 
CGTATTATCC 
ACCGTCCACA 
TTACCGGAAG 
TGTATCCTTA 
TCG 



CCACAAACTC 50 

CATCATCGTA 100 

TAGAATTAGT 150 

CCGGGTGATG 2 00 

CGGCGTTGCA 250 

ATACTTACAT 300 

CCAATTGAAG 350 

TCGTGTTGAG 4 00 

GTATCAAACC 450 

AAATTACTTG 50 0 

TGGCACTAAA 550 

GCTCAATCAC 600 

AAAGAAGAAG 650 

ATTCTATTTC 7 00 

GCGTGGAAAT 7 50 

ATCCACCCAA 8 00 
833 



186 
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2) INFORMATION FOR SEQ ID NO : 91 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



40 



(ii) MOLECULE TYPE: Genomic 



DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

15 (B) STRAIN: DSM 8925 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 
GAGTGGGAAG AAAAAATCAT TTTfTTATTA CCAATCGAAG 

25 SS3?SS 5SS 1°™? 

Eli ssssss sssss sssss ssssss 

3 5 TCGCGATGGA CGAAG 



2) INFORMATION FOR SEQ ID NO: 92 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolytacus 

(B) STRAIN: ATCC 333 90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

„ TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCAATG CCACAAACTC 
g?2agcacat CTTATTAGGT CGCCAAGTAG GTGTTCCATA CATCATCGTA 
??2otaaSa AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 
IgaaatISaa gttcgtgaac TTCTTTCTCA ATATGACTTC CCAGGTGACG 
ATACACCAAT CGTAC GTGGT TCTGCATTAC AAGCATTAAA TGGCGTAGCA 

so SStSx!^ aaaaaatcct tgagttagca aaccacttag ATACTTACAT 

187 



50 
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CCCAGAACCA GAGCGTGCAA TTGACCAACC GTTCCTTCTT CCAATCGAAG 350 

ATGTGTTCTC AATCTCAGGT CGTGGTACAG TAGTAACTGG TCGTGTAGAA 4 00 

CGTGGTATCA TCCGTACTGG TGATGAAGTA GAAATCGTAG GTATCAAAGA 450 

TACAGCAAAA ACTACTGTAA CGGGTGTTGA AATGTTCCGT AAATTACTTG 5 00 

5 ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGTACCAAA 55 0 

CGTGAAGAAA TCGAACGTGG TCAAGTATTA GCGAAACCAG GTTCAATCAC 600 

GCCACACACT GACTTCGAAT CAGAAGTTTA TGTATTATCA AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 7 00 

CGTACAACTG ACGTAACTGG TACTATCGAG TTACCAGAAG GCGTAGAAAT 750 

10 GGTAATGCCA GGCGATAACA TCAAGATGAC AGTAAGCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTTT CGCAA ' 



15 2 ) INFORMATION FOR SEQ ID NO: 93 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus parahaemolytlcus 

(B) STRAIN: ATCC 10014 



30 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 



GACGAAGGTT TACGTTTTGC GATT 



2 ) INFORMATION FOR SEQ ID NO: 94 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 833 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 



830 



TCTTAGTAGT AGCAG CAACA GACGGTCCAA TGCCACAAAC TCGTGAGCAC 50 

ATCTTATTAG GTCGCCAAGT AGGTGTTCCA TACATCATCG TATTCTTAAA 10 0 

CAAATG CGAT ATGGTTGACG ATGAAGAATT ATTAGAATTA GTTGAAATGG 150 

AAGTGCGTGA ACTTCTTTCA CAATATGACT TCCCAGGTGA TGACACGCCA 200 

35 GTAGTACGTG GTTCAGCGTT ACAAGCGTTA AACGGCGTAG CAGAGTGGGA 250 

AGAAAAAATT CTTGAATTAG CAAACCACTT AGATACATAC ATCCCAGAGC 3 00 

CAGAGCGTGC GATTGATAAA CCATTCTTAT TAC CAATCGA AGACGTATTC 3 50 
TCAATCTCAG GTCGTGGTAC AGTAGTAACA GGTCGTGTTG AGCGTGGTAT 
CATCAAAGCG GGTGAAGAAG TTGAAATCGT AGGTATCAAA GACACTGCGA 
4 0 AAACAACAGT AACTGG CGTG GAAATGTTCC GTAAATTATT AGACGAAGGT 

CGTGCGGGTG AAAACGTTGG TGCATTATTA CGTGGTACAA AACGTGAAGA 55 0 

AATCGAACGT GGTCAAGTGT TAGCGAAACC AGGTACAATT ACACCACACA 600 

CAGACTTCGA ATCAGAAGTG TAC GTATT AT CAAAAGAAGA AGGTGGTCGT 650 

CACACTCCAT TCTTCAAAGG TTACCGTCCA CAATTCTACT TCCGTACAAC 700 

4 5 TGACGTAACT GGTACTATTG AATTACCAGA AGGCGTAGAA ATGGTAATGC 750 

CAGGCGATAA CATCAAAATG ACAGTATCAT TAATCCACCC AATCGCGATG 800 



400 
450 
500 



824 



188 
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M)t S aa SiSSS: Haemophilus parainfluenza* 

(B) STRAIN: ATCC 7901 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 
TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG gACAAACTC 
GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGWCCTg TAGAATTAGT 
TTCTTAAACA AATGCGACAT ^TAGATGAC CCGGGTGACG 
TGAAATGGAA GTTCGTGAAC JTCTTTCTCA CGGCGTWGCA 

ATA SSSS5 aaaaaatcS tgSSSct AGCCACTTAG ATTCTTACAT 

GAATGGGAAG AAAAAATCUl t^^Xt\Xnn pTTr , TT CCAATCGAAG 

TCCTGAGCCT CAACGTGCTA TCGACCAACC ^CCTTCTT CCAATC^ 
ACGTGTTCTC JATCTCCGGT CGTGGTACAG JAGT^J GTATCAAAC C 
CGTGGTATCA TCCGTACCGG TGATGAA^i i AAATTACTTG 
GACTGCGAAA ACTACCGTAA £CGGTGTTGA ^GTTCC TGGTACYAAA 
ACGAAGGTCG TGCAGGTGAA AACATCGGTG GTTCAATCAC 

CG l^r^ SS™££ ctSaaSSa 2gtat^a?cc AAAGAAGAAG 
^^act ?Scca??c ttcaaaggtt accgtccaca attctatttc 
gtggtcgtca tactccattc ttaccggaag gcgtggaaat 

SSSSS? ggtgataaca SS Statcctta atccacccaa 
t^gcSatgga ccaaggttta cgtttcgcta tcg 

2) INFORMATION FOR SEQ ID NO: 95 

(i) SEQUENCE CHARACTERISTICS: 
30 ( A ) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi ) ORIGINAL^URCE : Haemoph . ius paraphrophiIus 
(B) STRAIN: ATCC 29241 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

= s ss ssssss ssssss sasss 

S^SSI Stgcgacat ggtagatgac gaagagttat tagaattagt 

45 TS iii llii ? T ™s sssss 
Eil ssss ss 

- ?SSSS£ SSS^S 

CGTGGTATCA ^^^^TT CCGGTGTTG A AATGTTCCGT AAATTACTTG 

lili 1IH bsss ssss = 
55 iHii ssssss ssssss ssss &gss| 

lill? SSSS^S SSSS52E ^ T cS 

TCGCGATGGA CCAAGGTTTA CGTT 



60 

189 
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2) INFORMATION FOR SEQ ID NO: 96 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus segnis 
15 (B) STRAIN: ATCC 333 93 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 

GCTATCTTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

2 0 GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTCCCAG GTGATGATAC 2 00 

TCCAATCATT CGTGGTTCTG CATTACAAGC GTTAAACGGC GTAGCAGAAT 250 

GGGAAGAAAA AATCCTTGAA TTAGCT CAAG CATTAGATAC TTACATTCCT 3 00 

2 5 GAACCTGAGC GTGCAATCGA CCAACCGTTC CTTCTTCCAA TTGAAGACGT 350 

GTTCTCAATC TCTGGTCGTG GTACTGTAGT AACAGGTCGT GTAGAGCGTG 4 00 

GTATCATCCG TACCGGTGAT GAAGTTGAAA TCGTTGGTAT CAAACCAACT 450 

GCGAAAACAA CCGTAACCGG TGTTGAAATG TTCCGTAAAT TACTTGACGA 500 

AGGTCGTGCA GGTGAAAACA TCGGTGCATT ATTACGTGGT ACTAAACGTG 550 

3 0 AAGAAATCGA ACGTGGTCAA GTATTAGCGA AACCGGGTTC AATCACTCCA 600 

CACACTGACT TCGAATCTGA AGTGTACGTA TTATCTAAAG AAGAAGGTGG 650 

TCGTCATACT CCATTCTTCA AAGGTTACCG TCCACAATTC TATTTCCGTA 700 

CAACTGACGT AACCGGTACT ATCGAGTTAC CGGAAGGCGT GGAAATGGTT 750 

ATGCCTGGCG ATAACATCAA AATGAC CGTA TCCTTAATCC ACCCAATCGC 8 00 

3 5 GATGGACCAA GGTTTACG 81 g 



40 



2) INFORMATION FOR SEQ ID NO: 97 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 763 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 0 (A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 1333 7 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 

55 CGGCGCTATC CTGGTTGTTG CTGCGACTGA CGGCCCTATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTW CATCATCGTR 10 0 

TTCCTGAACA AATGCGACAT GGTTGATGAT GAAGAGC TGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCWGGYGATG 200 

ATACTCCAAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGYGMACCT 2 50 

60 GAGTGGGAAG CTAAGATCGT AGAACTGGCT GAAACTCTGG ATTCTTACAT 300 

190 
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YCCACARCCA GAACGTGCTA TCGAYAAGCC ™GCTG CCAATCGAAG 
ACGTATTCTC TATCTCTGGC JGTGG^CWG QA 
CGCGGTATCG TTAAAGTTGG TGAAGAAGTT JJRATCGTT QCTGG 
TAC CGTTAAA TCAACTTGTA CCGGCGTTGA ^GTIC TGGTAT CAAG 
5 ACGAAGGTCG TGCAGGCGAG ^GTTGGTG TTCTGCT^ qytcYATCAA 
CGTGAAGACA TCGAACGTGG TCAG^i iu TATTCTGAGC AAAGATGAAG 
GCCACACACC AAGTTCGAAT CAGAAGTTTA TATTCTGA^ 

SS£££ SSSSSS ?IcC^CGAA TTGCCAGAAG GCGTGGAAAT 750 
10 GGTAATGCCA GGC 



350 
400 
450 
500 
550 
600 
650 
700 



15 



20 



2) INFORMATION FOR SEQ ID NO: 98 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: . . ^ 

25 (A ) ORGANISM: Kingella Jcingae 

( B ) STRAIN: ATCC 23330 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 

50 

30 CGGCGCAATC TTGGTA'ruxx ^ - - > — — ^CTTA TAT CAT C GT A 100 

150 
200 
250 
300 

3 5 GCATACAAAG AAAAAAl^ t^^^nn aTTPTTGTTG CCAATCGAAG 3 50 

400 
450 
500 
550 

4 0 ACGAAGGTCA AGCX^x^x ^^aarPAG GTTCTATCAC 600 

650 
700 
750 
800 

4 5 GGTTATGCCA CitiULxftA/^uvj 828 



ssssss us US iii? ™il 

TTC tI^ iESSSSS SSSSESE SSSmttt CCAGGCGACG 

TGAAATGGAA ATCCGTGACT ^ii^hi^ GYG CATTGGA AGGCGACGCT 

SSSSKS ESSSS S|5SJ|S SS™ ? t K ™ 

sssss s^sss? SS£ SSSSSS 

S ibsss ssssas ss ss 
Si ss ssssss sss 

SSrawE ctgaagtgta tgtgttgagc aaagaagaag 

TCCGCACACT TTCGCTAACT ACCGCCCACA ATTCTACTTC 

Eii IS ssss sssss 



T CGCTAGGAA AACGGTTTGC GTTTTGCG 



2) INFORMATION FOR SEQ ID NO: 99 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Klebsiella ornithinolytica 

(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAGCTGGTT 150 

GAAATGGAAG TCCGTGAGCT GCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACGCCGATC GTTCGTGGTT CCGCTCTGAA AGCGCTGGAA GGCGAAGCAG 250 

ACTGGGAAGC GAAAATCATC GAACTGGCTG GCTACCTGGA TTCTTACATC 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 55 0 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTCATGC CGG GCGACAACAT CAAAATGGTT GTTACCCTGA TTCACCCGAT ftnn 

CGCGATGGAC GATGGTCTGC GTTTCGCA 32Q 



2) INFORMATION FOR SEQ ID NO: 100 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 749 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 

GATGCCGCAG ACTCGTGAGC ACATCCTGCT GGGTCGTCAG GTAGGCGTTC 
CGTACATCAT CGTGTTCCTG AACAAGTGCG ACATGGTTGA TGACGAAGAG 
CTGCTGGAAC TGGTTGAAAT GGAAGTTCGT GAACTTCTGT CTCAGTACGA 
TTTCCCGGGC GACGACACTC CGATCGTTCG TGGTTCTGCT CTGAAAGCGC 
TGGAAGGCGA CGCWGAGTGG GAAKCKAAAA TCAT CGAACT GGCTGGCTTC 
CTGGATTCTT ACATTCCGGA ACCAGAGCGT GCGATTGACA AGCCGTTCCT 
GCTGCCGATC GAAGACGTAT TCTCCATCTC CGGTCGTGGT ACCGTTGTTA 
CCGGTCGTGT AGAGCGCGGT ATCATCAAAG TTGGCGAAGA AGTTGAAATY 
GTTGGTATYA AAGACACTGC TAAGTCTACC TGTACTGGCG TTGAAATGTT 
CCGCAAACTG CTGGACGAAG GCCGYGCTGG TGAGAACGTT GGTGTTCTGC 
TGCGTGGTAT CAAACGTGAA GAAATCGAAC GTGGTCAGGT ACTGGCTAAG 
CCGGGCTCTA TCAAGCCGCA CACCAAGTTC GAATCTGAAG TTTATATCCT 
GTCCAAAGAC GAAGGCGGCC GTCACACTCC GTTCTTCAAA GGCTACCGTC 
CGCAGTTCTA CTTCCGTACA ACTGACGTGA CTGGCACCAT CGAACTGCCG 
GAAGGCGTAG AGATGGTTAT GCCGGGCGAC AACATCAAAA TGGTTGTTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
749 



2) INFORMATION FOR SEQ ID NO: 101 
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10 



15 



20 



25 



30 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 

TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG 
GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA 
TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC 
TGAAATGGAA GTTCGTGAGC TGCTGTCTCA GTACGACTTC 
ACACTCCGAT CGTTCGTGGT TCCGCTCTGA AAGCGCTGGA 
GACTGGGAAG CGAAAATCAT CGAACTGGCT GGCTACCTGG 
CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG 
ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG 
CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATC GTTG 
TACTGCTAAR TCWACCTGTA CYGGCGTTGA AATGTTCCGC 
ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG 
CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG 
GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA 
CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG 
GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG 
TCGCGATGGA CGACGGTCTG CGTTTCGCAA 



CCGCAGACTC 50 

CATCATCGTG 100 

TGGAACTGGT 150 

CCGGGCGACG 2 00 

AGGCGAAGCA 2 50 

ATTCTTACAT 3 00 

CCTATCGAAG 3 50 

TCGTGTAGAG 4 00 

GTATCAAAGA 450 

AAACTGCTGG 500 

TGGTATCAAA 550 

GCTCTATCAA 600 

AAAGACGAAG 650 

GTTCTACTTC 70 0 

GCGTAGAGAT 750 

ATTCACCCRA 800 
830 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 102 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 
TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 
AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAGATGGA 

55 £S£?2g?gaa CTGCTGTCTC AGTACGATTT CCCGGGCGAC GACACCCCGA 200 

??GTT?GTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCCACCTG GATACCTATA TCCCGGAACC 3 00 

SSS?gS ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

6 0 AT C AAAGTAG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AAAC CGCGAA 450 



50 
100 
150 



193 



CA 02283458 1999-09-28 



AACCACCTGT ACTGGCGTTG 
GTGCTGGTGA GAACGTAGGT 
ATCGAACGTG GTCAGGTACT 
CAAGTTCGAA TCTGAAGTGT 
ATACTCCGTT CTTCAAAGGC 
GACGTGACTG GCAC CAT CGA 
GGGCGACAAC ATCAAAATGG 
ACGACG 



AAATGTTCCG CAAACTGCTG 
GTTCTGCTGC GTGGTATCAA 
GGCTAAGCCG GGCACCATCA 
ACATCCTGTC CAAAGACGAA 
TACCGTCCGC AGTTCTACTT 
ACTGCCGGAA GGCGTAGAGA 
TTGTTACCCT GATCCACCCG 



GACGAAGGCC 


500 


AC GTGAAGAA 


550 


ACCCGCACAC 


600 


GGCGGCCGTC 


650 


CCGTACTACT 


700 


TGGTAATGCC 


750 


ATCGCGATGG 


800 




806 



2) INFORMATION FOR SEQ ID NO: 103 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoni 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

GCAGACTCGT GAGCACATCC TGCTGGGTCG TCAGGTAGGC GTTC CGTACA 
TCATCGTGTT CCTGAACAAA TGCGACATGG TTGATGACGA AGAGCTGCTG 
GAACTGGTTG AGATGGAAGT TCGTGAACTG CTGTCTCAGT ACGATTTCCC 
GGGCGACGAC ACTCCGATCG TTCGTGGTTC TGCTCTGAAA GCGCTGGAAG 
GCGACGCAGA GTGGGAAGCG AAAATCATCG AACTGGCTGG CCACCTGGAT 
ACCTATATCC CGGAAC CAGA GCGTGCGATT GACAAGC CGT TCCTGCTGCC 
GATCGAAGAC GTATTCTCCA TCTCCGGTCG TGGTACCGTT GTTACCGGTC 
GTGTAGAGCG CGGTATCATC AAAGTAGGTG AAGAAGTTGA AATCGTTGGT 
ATCAAAGAAA CCGCGAAAAC CACCTGTACT GGCGTTGAAA TGTTCCGCAA 
ACTGCTGGAC GAAGGCCGTG CTGGTGAGAA CGTAGGTGTT CTGCTGCGTG 
GTATCAAACG TGAAGAAATC GAACGTGGTC AGGTACTGGC TAAGCCGGGC 
ACCATCAACC CGCACACCAA GTTCGAATCT GAAGTGTACA TCCTGTCCAA 
AGACGAAGGC GGCCGTCACA CTCCGTTCTT CAAAGGCTAC CGTCCGCAGT 
TCTACTTCCG TACTAC TGAC GTGACTGGCA CCATCGAACT GCCGGAAGGC 
GTAGAGATGG TAATGCCGGG CGACAACATC AAAATGGTTG TTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
743 



2) INFORMATION FOR SEQ ID NO: 104 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. rhinoscleromatis 

(B) STRAIN: ATCC 13 884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 
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20 



TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC 
r-rrr-TOGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTQ.AAt.AA 
SgcSStg GTTGATGACG AAGAGCTGCT GGAACTGGTT GAGATGGAAG 
^SSratr? RCTGTCTCAG TACGATTTCC CGGGCGACGA CACCCCGATC 
SSSSwS cSSSgaa AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 
^HS?t??t? GAACTGGCTG GCCACCTGGA tacctatatc CCGGAACCAG 
?^?2£t TotcAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 
iSSSSgJ OTGGTACCGT TGTTACCGGT CGTGTAGAGC GCGGTATCAT 
SISSSSt GAAGAAGTTG AAATCGTTGG TATCAAAGAA ACCGCGAAAA 
SxOTtSS ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 
S^Stcaga ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 

SSStgot caStactgg ctaagccggg caccatcaac ccgcacacca 

C ^^T°°p TrAAGTGTAC ATCCTGTCCA AAGACGAAGG CGGCCGTCAC 

aSc^S Sca^aggSa ccg?ccgcag ttctacttcc gtactactga 

^nln^r^r ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 
SSESES GTTACCCTGA TCCATCCGAT CGCGATGGAC 
GACGGT CTGC GTTTCGCAA 

2) INFORMATION FOR SEQ ID NO: 105 



( i ) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 
rrrrGCGATC CTGGTTGTTG CTGCGACTGA TGGCCCTATG CCACAGACTC 

cctgctgggt cgtcaggtag gcgttcctta catcatcgtg 
SSSSSS SgygacS ggSStgac gaagagctgc tggaactggt 

Z^I%r£*K rTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 

TCTGCTCTGA AAGCGCTGGA AGGCGATGCA 
rACTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 
JcSgaSc^a gaacgtgcta TCGATAAGCC GTTCCTGCTG CCAATCGAAG 
Stattctc CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 
cISStATCA TCAAAGTTGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 

ScSSaag tctacctgta ccggcgttga aatgttccgc aaactgctgg 
SgSggccg tgctggtgag aacgttggtg ttctgctgcg tggtatcaaa 
cg?gaagaaa tcgaacgtgg tcaggttctg gctaagccag gctctatcaa 
gcS^cacc aagttcgaat ctgaagtgta cattctgtcc aaagacgaag 

?rSScCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 
Stactactg ACGTCACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 
SKSgSS ggSacaaca TCAAGATGGT TGTGACTCTG ATCCACCCAA 
TCGCGATGGA CGACGGCCTG CGTTTCGCAA CC 



2) INFORMATION FOR SEQ ID NO: 106 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Jfluyvera cryocrescens 
10 (B) STRAIN: ATCC 33435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 106 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACTC 50 

15 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGTGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

ACACTCCTAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGACGCT 2 50 

GAGTGGGAAG CAAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT inn 

20 CCCAGAACCA GAGCGTGCGA TTGATAAGCC GTTCCTGCTG CCAATCGAAG Vkn 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

CACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 50 0 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 5 5n 

25 CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCCATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCAA Rnn 

3 0 TCGCGATGGA CGACGGTCTG CGTTTCGCAA J™ 



35 



2 ) INFORMATION FOR SEQ ID NO: 107 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

50 CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCGTACAT CATCGTGTTC ion 

GCGACATGGT TGATGACGAA GAGCTGCtSg AACTgSSS 150 

AATGGAAGTT CGTGAACTTC TGT CTCAGTA CGACTTCCCG GGCGACGACA Jnn 

cc ^ C S GATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCTgIS III 

55 TGGGAAGCGA AAATCATCGA ACTGGCGGGC TTCCTGGATT CTTACATCCC Inn 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT OTGCTOcS AT CGAAGACG Iso 

^^ CCAT CTCC£ 3GTCGT GGTACCGTTG TTACCGGTCG TGTAGAACG? 400 

GGTATCATCA AAGTTGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGACAC tin 
c , GGCT AAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTgSSg 
60 AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 



500 
550 
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GAAGAAATCG AACGTGGTCA 
GCACACCAAG TTCGAATCTG 
GCCGTCATAC TCCGTTCTTC 
ACTACTGACG TGACTGGCAC 
5 AATGCCGGGC GACAACATCA 
CGAAGGACGA AGGTCTGCGT 



GGTACTGGCT AAGCCGGGTT 
AAGTGTACAT TCTGTCCAAA 
AAAGGCTACC GTCCGCAGTT 
CATCGAACTG CCGGAAGGCG 
AAATGGTTGT TACCCTGATC 
TTCGCA 



CTATCAAGCC 600 
GAC GAAGGCG 650 
CTACTTCCGT 700 
TTGAGATGGT 750 
CACCCGATCG 800 
826 



10 2) INFORMATION FOR SEQ ID NO: 108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 
15 (c) STRANDEDNESS: Double 

(p) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: , 

20 K ' {A) ORGANISM: Lactobacillus casei subsp. casei 

(B) STRAIN: ATCC 393 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 

25 PPTGCTGATG GCCCAATGCC ACAAACTCGT GAACATATCT TACTTTCACG 50 
?SSg??5ot GTTCCATACA TCGTTGTATT catgaacaaa tgtgacatgg 
TTGACGATGA AGAATTACTA GAATTAGTTG AAATGGAAAT TCGTGATCTA 
™c?gSat ATGAATTCCC TGGCGATGAC attcctgtaa tcaaaggttc 
APCTCTTAAA GCACTTCAAG GTGAAGCTGA CTGGGAAGCT AAAATTGACG 
JgSSSS AGCTGTAGAT TCTTACATTC CAACTCCAGA ACGTGATACT 
SSSaaccat TCATGATGCC AGTTGAGGAT GTATTCTCAA TCACTGGTCG 

?5g^agtt gcaactggac gtgttgaacg tggacaagtt aaagttggtg 

AGTTATCGGT A^AAJAAQ JTOCMMX ACTA™ 



100 
150 
200 

30 AGCTCTTAAA ™ = - — n^nTrrlal ACGTGATACT III 

350 
400 

„ iS^AG?23 AAATGTTCCG 9^5aCTA GATTACGCTG AAGCTGGCGA 500 
?2SS?S? GCACTTCTAC GTGGTGTTGC TCGTGAAGAT ATCCAACGTG 550 
CAACATTGGC- ^^^I.tV^I «.^n,mm^ nTnn*n*r*r TZVACTTCAAA 600 

650 

.o S?S?SraC ACTTCcXg^ GGTACTGAAA TGGT AATGC C TGGTGATAAC 750 
SSSSm CAGTTGANCT AATTGC AC C A ATCGCTATCG AAGACGGTAC 800 



45 



50 



55 



SSlSSSi Xgct^XcSX ggttcgatta ctccacacac taacttcaaa 

GCTGAAACTT ATGTTTTAAC TAAAGAAGAA GGTGGACGTC ACACTCCATT 

2??2aa^aac taccgcccac aattctattt ccgtactact gacgtaactg 

CllLAA^^ x , ^m^^r-TMviv ^T^TP-Pr TttOTGATAAC 



TAA 



2) INFORMATION FOR SEQ ID NO: 109 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 7 . . 

(A) ORGANISM: Lactococcus lactis subsp. lactis 

(B) STRAIN: ATCC 19435 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 

197 
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CGGTGCAATC CTCGTTGTTG CTGCAACTGA TGGACCAATG CCACAAACTC 50 

GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC 100 

TTCCTTAACA AGGCTGACCT TGTTGATGAT GAAGAATTGA TGGAACTCGT 150 

5 TGAAAT GGAA GTTCGTGACC TCTTGAGCGA ATACGACTTC CCAGGTGACG 200 

ATATTCCTGT AATCGCTGGT TCAGCACTTG GTGCTTTGAA CGGTGAACCA 2 50 

CAATGGG TTG CTAAAGTTGA AGAATTGATG GACATCGTTG ATGAATACAT 300 

CCCAACTCCA GAACGCGACA CTGACAAACC ACTCCTTCTT CCAGTCGAAG 3 50 

ACGTATTCTC TATCACTGGT CGTGGTACAG TTGCTTCAGG ACGTATCGAA 4 00 

10 CGTGGTACTG TTAAAGTTGG TGACGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

AGAAAC TAAA AAAGCTGTTG TTACTGGTAT CGAAATGTTC CGTAAAACAC 50 0 

TTACTGAAGG TCTTGCTGGT GATAACGTCG GTGCACTTCT CCGTGGTATC 550 

CAACGTGACG AAATCGAACG TGGT CAAGTT ATTGCTAAAC CAGGTTCAAT 60 0 

CACTCCACAC AAACTTTTCG AAGGTGAAGT TTACGTATTG AGCAAAGAAG 65 0 

15 AAGGCGGACG TCACACTCCA TTCTTCGACA ACTACCGTCC TCAATTCTAC 700 

TTCCACACAA CTGACGTTAC TGGTTCAGTT AAACTTCCAG AAGGAAC TGA 750 

AATGGTAATG CCTGGTGACA ACGTGCATAT CGACGTTGAA TTGATCCACC 800 

CAGTTGCGAT CGAACAAGGT ACTAC 825 



20 



2) INFORMATION FOR SEQ ID NO: 110 



(i) SEQUENCE CHARACTERISTICS : 

2 5 (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 5 0 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTTC ATCATCGTGT 100 

4 0 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGTGAACT YCTGTCCCAG TACGACTTCC CGGG CGACGA 2 00 

CACCCCAATC GTTCGTGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCAG 2 50 

AGTGGGAAGA GAAAATCATC GARCTGGCTG GCTACCTGGA TTCCTACATC 3 00 

CCAGAGC GAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 3 50 

4 5 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 4 00 

GCGGTATCAT CAARGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAGGAC 450 

ACTGCTAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCYATCAAG 600 

50 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCYA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACKACTGA CGTGACCGGT ACCATCGARC TGCCAGAAGG CGTTGAGATG 750 

GTAATGC CAG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT ft on 

CGCAATGGAC GATGGTCTGC GTTC JuU 
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2) INFORMATION FOR SEQ ID NO: 111 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: m 1ndadml 

(A) ORGANISM: Legionella micdadei 

(B) STRAIN : ATCC 33218 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 
CGGAGCGATA TTAGTAGTAT CAGCAGCGGA TGGCCCAATG CCTCAAACGA 

SSSS SS 3KSSS GC = T S TAGT 
TGAAATGGAA GTACGCGAYT TGTTGAGCAG ^TGAATTT GATACG 
AGATCCCGAT TGTAGTTGGT TCAGCATTAA Ajggg cGATGGATTC 
AGTGATATAG GTGTACCAGC ™TTGM»M SSSSlTTC TTGTTACCGA 
TTATATACCT GAGCCGGTAA GAAACATCGA TAAft* AACAGGACGT 
TCGAAGACGT GTTCTCAATA TCTGGACGAG «AACA£J£^ ijupQTTGGTAT 
ATCGAAAGCG GGATC AT CAA AGTTGGTGAG GAAGTCGAGA ^ 
ACGTGACACT CAAAAGACGA CATGCACAGG CGTTGAAATG ^ 
TACTTGACGA AGGTCGAGCT GGAGACAACG JTGGTATATT £ 
ACGAAGCGGG ATGAAGTTGA ACGCGGACAA GTATTAGCTA 
CATTAAACCG GATACTAAAT TTGAAGCTGA J™J*J™ GCCTCAATTT 
ATGAAGGTGG ACGTCATACC CCATTCTTTA ^ CTGARGGTAT 

« « « ?™ " 
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2) INFORMATION FOR SEQ ID NO: 112 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi> ^r lN ^S? : Legionella pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 

CGGAGCGATA CTGGTTGTAT CAGCAGCTGA TGGTCCTATG CCACAAACGA 
GGGAACACAT TCTATTGTCT gGCCAGGTAG GTGTTCCATA J^^. 
TTCATGAACA AAG CGGATAT GGTTGATGAu ^-^^ CCAGGGGATG 

sssss =s seek US sssss 
ssss; sees ssssa ssssss sssss 

SSSSSS 22E35 ESSES |~ ggggg 

S3SSS SSSSSSS ATTACGAGGT 

199 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 



CA 02283458 1999-09-28 



* 



~*^ GCGAG ATQ AAGTGGA GCGTGGACAG GTATTGGCGA AGCCAGGAAP 
CATCAAGCCA CACACCAAGT TTGAAGCAGA AGTGTATGTA TTAToSg5 
^S^ 00005 ACGTCACACT CCATTCTTTA ATGGATACCG TC^2S?2 
TATTTCAGAA CCACTGACGT GACAGGTACT TGTGACTTGC StoSSS? 
TGAAATGGTA ATGCCTGGAG ATAATGTGCA ATTAGTTGTT ipptt™ 750 
CTCCGATTGC GATGGATGAA GGTTTAAGAT TCGcS? AGCTTGCATG 800 



10 2) INFORMATION FOR SEQ ID NO: 113 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33999 
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2) INFORMATION FOR SEQ ID NO: 114 

( i ) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 826 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella richardii 

(B) STRAIN: ATCC 33998 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 

200 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 

?n^^S T GGTAGT AGCA GCGACTGACG GCCCGATGCC TCAGACTCGP 
^™? CC TGCTGG GTCG TCAGGTAGGC GTTCCGTACA TCATcSS5 
CCTGAACAAG TGCGATATGG TTGATGACGA AGAGCTrrTn ninn^-ZI 
ARATGGAAGT TCGCGAACTG CTGTCTCAGT A^gStS? SSS^ 
ACTCCGGTAG TCCGCGGTTC AGCGCTGAAA GCGCTGgSo 200 

aaaatcatcg agctggca^S ac?t!?a?S III 

CAGAACCTGA GCGTGCGATT GACAAGCCGT TCCTGCTGCC iSSSJ^S 3 °° 

GTATTCTCTA TCTCCGGCCG TGGTACCGTT GTT^SSJc 5!£S5JS£ 350 

CGGCATCATC AAAGTCGGTG AAGAAGTGGA AATcSSS SSa*™™ 400 

CCACCAAGAC CACCTGTACC GGCGTTGAAA ^JSSSaa A ^ GAAAGAT A 450 

GAAGGCC GTG CGGGCGAGAA SSSSSJi S5£Sg£S SSSSSSS 5 °° 

SSSS3S SSggg IgSE ™ S ™ 

SHHS SSSS5 S s ssss SSKK So 0 
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GCTATCCTGG TTGTTGCTGC GACTGACGGC CCAATGCCTC AGACTCGTGA 
GCACATCCTG CTGGGTCGCC AGGTAGGCGT JCCTTAU ACTGGTAGAA 
TGAACAAGTG CGACATGGTT GATGATCAAG AGCTGCJ GCGACGATAC 

5 ATGGAAGTTC GTGAACTTCT ^TCAATAC GACT ^ 

GCCGGTTGTT CGCGGTTCAG CGCTGAAAO<- c ttayattcCA 

GGGAARCGAA AATCATTGAA CTGGCGGAAT gli TCGAAGACGT 
GAGCCAGAGC GTGCGATTGA CAA?CgTTC ^ GTAGAGCGCG 
TTTCTCTATC TCTGGCCGTG GTA^^GT ^CG^ caaaqACACC 

10 GCATCATCAA AGTTGGTGAA GAAGTGG^A TCG1 ^c^CGA 
ACCAAGACCA GCTGTACTGG CGTTGAAATG I ACTAAGCGTG 
AGGC CGTGC A GGTGAGAACG TTGGTGT1C1 £ CATCACTCCT 
ACGAAATCGA ACGTGGTCAG ^ACTGGCTA AGCC^GG^ ATGAAGGCGG 
CACACAGAAT TCGTGTCAGA AGTGTATATC CT^A^a gta 

15 YCGTCATACT CCGTTCTTCA AAGGCTACCG ^TCAGTTC agagatggta 

SS SESStt acgctgattg ccccaatcgc 
SItggaSgaa ggtctgcgct tcgcaa 



2) INFORMATION FOR SEQ ID NO: 115 

(i) SEQUENCE CHARACTERISTICS : 
( A ) LENGTH: 843 bases 
" (B ) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(B) STRAIN: ATCC 23581 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 
TGCGGCGATT CTTGTAGTAT CCGCAACTGA CGGACCTATG CCACAAACAA 

Iagaacatat ccttcttgct cgtcaggtag gtgttccata tg gaaatgat 

TTCATTAACA AAGCAGATAT ^CTTGCTGCT GACGAAA^ TATAGCTTCC 
CGAAATGGTT GAGATGGACG TTCGTGAACT J AGCACTTGAG 
CAGGAGATAC AACTCCTATC GTTCATGGTT ^ TGATGGAAGC 
GGCGATGAAT CTGAAATTGG JATGCCTGCA A^CTCAAA aaaccTTTCC 
TCTGGATACT TTCGTTCCAA ATCCAAAACG TGTAMJ aactgttGCA 

45 ttatgccagt agaagacgtt ttctcgatca aagttgaaat 

> ACTGGAAGAG TGGAACAAGG TGTTTTGAAA tog T ATCGAAA tgt 

TATCGGTATC CGCCCAACAA CAAAAACTGT TGTTACCG^i 
TCAGAAAACT TCTCGATCAA GCGGAAGCTG GCGACAACAT C QCGAA 
CTTCGTGGAA CTAAAAAAGA AGAAATCGAA AGAGGGUAA gtgtatgtat 

ssss ss= fiiil sr— 
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"ESSE SS5SS iS= GTCTTAAGTT CGC 

2) INFORMATION FOR SEQ ID NO: 116 
( i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Megamonas hypermegale 
10 (B) STRAIN: ATCC 25560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 

,c CGG TGCTATC CTCGTTGTTA GTGCTGCTGA TGGTCCTATG CCTCAGACTC 

15 GTGAACACAT CCTTCTCGCT CGTCAGGTTG GTGTTCCAGC TATCGTTCTA i nn 

^T CAACA AAGCTGACCA GGTTGATGAC CCTGAACTTC SgSSJgT lln 

TGAAATGGAA GTTCGTGAAC TTCTTTCCAG CTATGACTTC cSSgcgI^G Inn 

ACGTTCCAGT AATCACTGGT TCCGCTCTTC AGGCTCTCGA AGGCgSSSS 

GAAGCTAAAA AGAAAATTCT TGAATTAATG GATGCTGTTG SS inn 

20 CCCAACTCCA ACACGTGACA CTGATAAACC TTTCTTAATG SS? 122 

ACGTATTCAC AATTACTGGT CGTGGTACTG TTGCTACAGG cSJg™? 

CGTGGCGAAC TTAAACTTGG TGACAGCGTT GAAATCOT?g 122 

SSJJ* 3 *** TCCACT ACTG TAACTGGTAT CGAAATGTTC tnn 

„ ^^ AGCGC TGTTGCTGGT GATAACATCG GTGCACTTCT S£s525K «2 

25 GACCGTAAAG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CTGGCaSS S2 

TCATCCACAC AAAAAATTCA AAGCTCAGGT TTACGTATTA ActSJSJS «2 

AAGGTGGACG TCATACTCCA TTCTTCTCCA ACTATCGTCC aSot™ 

TTCCGTACTA CTGACGTTAC TGGTGTTGTA ACTCTTCCAT JSSSSI 00 

,n ^° GTTATG CCTG GCGATA ACmSSSS S5S5SSE ctSSSJ ln°n 
30 CAATCGCTAT TGAAAAAGGT CTTCGCTTCG CT CTCATCACTC 800 
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2) INFORMATION FOR SEQ ID NO: 117 

( i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A > ORGANISM: Mitsuokella multacida 

(B) STRAIN: ATCC 27723 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 117 

11 IS IIS SSI 

mmmmm - 
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,0 H^S^«^S™~ 500° 
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OMSOMO »«TCQM» J2f£GGTT SS 2S3SS» 

ss ^sss Slii sssss ssssss soo 

s SSSSS SSSSSS SS325 S£»iwo s . 

CGATCGCTAT CGAGAAGGGC 

L0 2 ) INFORMATION FOR SEQ ID NO: 118 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 831 bases 
in) TYPE: Nucleic acid 
C STRANDEDNESS : Double 
15 (D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 ^'^oSSMobilun^ curtisii subsp. holmesii 
( B ) STRAIN: ATCC 35242 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 
25 CGGCGCTATC CTCGTGGTGG CXOgggOJ CGGTCCGATG ^AGACCA 
AGGAACACAT CCTGTTGGCT AAGCAGGTTG ^GAAGACA TGCTCGAAAT 
GCTCTGAACA AGTGCGATTC TTCCGATGTG c TXCGATC GTG 

STCGAGGAC GAAATCCGCG ATGACCTGGA gggM^ AGGCGACCCC 
30 ACTGCCCGAT TATCCACGTT JCCGCTCi^ CQ ATA CCTACAT 

GAGTGGACCA AGAAGATTGA AGAGCTCATG ^ cTTGATG GCTAT CGAAG 
TCCTGAGCCT GTTCGTGACC JOGACAABU- ^ TCGTGTGGAA 

ACGTCTTCAC CATTACTGGT CGCGGTACCG XA^ GTATTCGTCC 
CGCGGCAAGC TACCGTTGAA £GCCGAAGTG ujkh c ^tccatgg 

35 TACGCAAAAG AC CACCGTTA JCGCTATOGA Aft Q TGGCAC CAAG 
ACGAAGCCTA CGCCGGCGAG AACTGTGGTC ^ QCTCCGTGAC 
CGTGAGGACG TTGAGCGCGG TCAGGTTGTU ^ Q aagGACGAAG 

cScGCACACC AAGTTCGAGG ^CAAGGTCTA CATCTTQ^ 

gtggacgtca caagtcgttc tacgacggct a gaag GCACCGAAAT 
SSSSSS SgSStag cgttgagctg ATTCAGCCTA 
tcgctItgga ggaaggtctc ggcttcgcta t 
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2) INFORMATION FOR SEQ ID NO: 119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 ( C ) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

( B ) STRAIN: ATCC 3 5017 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 
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2) INFORMATION FOR SEQ ID NO: 120 

(i) SEQUENCE CHARACTERISTICS- 
25 (A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Branhamella catarrhalis 

(B) STRAIN : ATCC 43628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 
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2) INFORMATION FOR SEQ ID NO: 121 
( i ) SEQUENCE CHARACTERISTICS : 

204 
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GGTGCAATTC TGGTTGTTGC TGCAACTGAT GGCCCTATrr pjp.p,^ 
IGAGCACATC CTGTTAGGTC GTCmSSS SSSSE A^S 
^CTGAACAA ATGTGACATG GTAGACGACG AAGAGCTGTT AgSctoSS 
GAAATGGAAG TCCGTGAGCT GCTGTCTCAG TACGATTTnr r^ZZZSS 

gSffi SSS55S S5SS5S S! 

sssss jggsssg sags sdis iiil >« 

GTGGTATCGT TAAAGTCGGT GAAGAAGTTG 5S£S£ 25SI25S 4 ° C 

ACCGTGAAAA CAACATGTAC TGGCGTTGAA ATCT^cSK 450 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT tSgSgSt 5 °° 

ssssss ssss ss £ili IiE «»° 

sssss ssse ilil IIP "»° 

SS SS 3^ SSSJS ST K! 
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SSSSSS SSSSS 3K25S SEEK G CCACAAACTC *° 

TTCATGAACA AGTGCGATAT SESSSS SJ™ 301 * "0 

TGAAATGGAA GTTCGTGAAC TTCTATCTPa ™^ G £BS TAGAATTGGT 15 0 

ATACCCCAAT CATCAAAGGT tSgoEES *Inl»^ TC CCT <3GTGATG 2 00 

GGTAAATATG GCGAGCCTGC IotSSSS? JJSSJP^ TGG TTCTGAT 250 

CTATATCCCA GAGCC?GAG? SKmSSJ ^ CTAGACAG ^0 

TTGAAGATGT CTTCTCGATC TCAtAtShS? 0™™°*"° TT GATGCCAA 350 

GTTGAATCAG GTAtSSX IStJgCTgI? SiSSSS* GA ™GW»T 400 

CAAACCAACT GCTAAAACCA CCTGTACTcn SSJK^** TCA TCGGTAT 450 

tgttagacga aggt^SS gotSSS?? ^ CCGTAAAC 500 

ACTAAGCGTG AAGAAGTTCA ArrrrrrP^ GTGGTATCTT GTTGCGTGGT 550 

AATCACCCCA CATACtSot JSSSESf 23552?* AACCAGGTT C 60S 

AAGAAGGTGG TCGTCActcC CoStSES" AG ™J GTA CTGTCAAAAG 650 

TACTTCCGTA CCACAGATGT S£SSS£ iSSSESK CCCACA( 3TTC 700 

CGAAATGGTT ATGCCTGGTG ACAaSSS £ISS5£££ AAGAAGGTAC 750 

ACCCAATCGC CAGGATAAAG GTCtI2" GATGAGTGTT GAGCTTATCC 800 
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(A) LENGTH: 806 bases 

i-a) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ^ GIN SGS? : Morganella .orgranii subsp. morgana! 

( B ) STRAIN: ATCC 25830 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 121 
CGGCGCTATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACCC 
15 GTGAGCACAT CCTGTTAGGT CGTCAGGTTG «J TGGAACTGGT 
15 TTCCTGAACA AATGTGACAT ^TGATGAT CCTGGCGA CG 
TGAAATGGAA GTTCGTGAAC TTJJGTCTCA £i CTGQA AG GCGAGCCA 
ACACGCCAAT CGTTCGCGGT TCAGCGCTGA AAGCA C ATTCTTACAT 
GAGTGGGAAG CTAARATCGT TGAACTGGCA GG Q CCAATCG AAG 

20 CCCTGAGCCA GAGCCTGCAA TTGACAAGCC TCGTGTT GAG 
ACGTATTCTC AATCTCCGGC CGTGGTACL.^ ii GTA TCAAAGA 
CGCGGTATCA TCAAGGTTGG TGAGGAAGTT ^CGT^ 
TACTGCGAAA AC C ACCTGT A CCGGTGTTGA AAl tqgTACCAAG 
ACGAAGGCCG TGCMGGTGAG AAJGTCGGTG li CCAG gttCAATCAA 
25 CGTGAAGAAA TCGAACGTGG JCAGOTTCTG ^TAja aaaGATGAAG 
ACCACAYACC AAATTTGAAT CAGAAGTTTA TATT ^ GTTCTACTT c 
GTGGTCGTCA TACTCCATTC TTCAAAGGYl Q GCGTTG AAAT 

SSESSS GGCGACAACA CGTCACCCTG ATCCACCCAfc 

3 0 TCGCAA 
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2) INFORMATION FOR SEQ ID NO: 122 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 825 bases 
(b) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

<vi ' S G "^S' wc—ewri- tuberculosis 

(B) STRAIN: TB 299 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 < 
GGXGCGATCC TGGTGGTCGC CGXCACCGJC ^CCGATGC CCCAGACCCG 
CGAGCACGTT CTGCTGGCGC GTCAAGTGGG TGTGCCCT^ cGAACTCGTC 
CGCTGAACAA GGCCGACGCA GTGGACGACG ^ Q acgaqGACGC 

GAGATGGAGG TCCGCGAGCT GCTGOTGCC U£ QACGCGAAGT 
CCCGGTTGTG CGGGTCTCGG CGCTCAAGGC ^ ^ GTCG ATTCCG 

55 GGGTTGCCTC TGTCGAGGAA. CTGATGAACG ^ CQG TCGAGG ACGT 
QACCCGGTCC GCGAGACCGA CAAJCCGTTC CTGAl^ GTGGAGCGCG 
CTTCACCATT ACCGGCCGCG GAACCGTGGT JACCGGA^ TCGCCCATCG 

SStSatcaa cgtgaacgag Jaagttgaga JCGtcggcat agctgctcga 

60 SS5SSSS SS C G C G?GACA SEES? GCTGCTGCGG GGCGTCAAGC 
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GCGAGGACGT CGAGCGTGGC CAGGTTGTCA CCAAGrrrar ninnun™™ 
CCGCACACCG AGTTCGAAGG CCAGoS?2 AGGA^gS 
CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTrr 
GCACCACCGA CGTGACCGGT GTGGTGACAC TGCCGGAGTP rlrS^HS 
GTGATGCCCG GTGACAACAC CAACATCTCG OTgSg^S SSSJSS 
CGC CATGGAC GAAGGTCTGC GTTTC GAAGTTGA TCCAGCCCGT 



10 2) INFORMATION FOR SEQ ID NO: 123 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 

(B) STRAIN: ATCC 14685 
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2) INFORMATION FOR SEQ ID NO: 124 

( i ) SEQUENCE CHARACTERISTICS • 
- n l A l LENGTH : 822 bases 

50 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elonaata 

(B) STRAIN: ATCC 25295 eiongata 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 124 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 

SESSS 3KK5 SSKS c c ? G G S SSSSS - 

5SSSSS52 SSSSS 1I gSS SL ™ «°S 

ACTGCC CGAT CGTACAAGGT SSSSKS? JS^EF 0 CCAGGTGACG 200 

GCTTACGAAG AAAAAATC?? SSSSS AGGCGACG CA 250 

CCCAACACCT GAGCGT^G 5gg1cISS SSSSS iS2S?2J 300 

ACGTATTCTC TATTTCCGGT CGCGGTACAG mSSSSS ™~™ CGAAG 350 

CGCGGTATCA TCCACGTTGG TGACGACATP S?JJ£22*? TCGTGTAGAG 4 00 

AACTCAAAAA ACCaSSS S5££SK 2S5KSEK SSSSS^ 450 

ACGAAGGTCA AGCTGGTGAC AACGTAGGTG MtS5S£ ^S^ 00 5 <>° 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG rSS^n TGGTACTAAA 550 

TCCTCACACC AAGTTCAAAG cS^gSa otJSJSSS SSSSJ?^ 600 

GTGGTCGTCA CACTCCGTTC TTCGCTAACT AcSSSSS t^^" 3 * 00 650 

CGTACTACCG ACGTAACCGG CGCGGTTaS £S2SSS A ES^ ACTTC 700 

GGTAATGCCG GGTGAGAACG T**S£2 SESKgg SSSSSSS JSS 
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ss ssss| i» ssgg S£££ 

5 TGAAATGGAA ATCCGTGACT TGCTGTCAAG ^CCTTGGA AGGCGACGCA 
ACTGCCCGAT CGTACAAGGT TCCGCACTGA AAGCCl ACAGCTACAT 
GCTTACGAAG AAAAAATCTT CGAACTGGCT ^GCATT^ qctATCGAAG 
CCCGACACCT GAGCGTGCCG TGGACAAACC to Q XCGTG TAGAG 

acgtattctc tatctccggc cgtggtacag GTCTGAAA ga 

10 CGCGGTATCA TCCACGTCGG TGACGAGATC ^ AAACTGCTGG 
AACCCAAAAA ACCACTTGTA CCGGTGTTGA AAlto CGGTACCAAA 
ACGAAGGTCA AGCAGGTGAC ^CGTAGGCG TATTGCTG^ ^ 
CGTGAAGAAG TGGAACGCGG TCAAGlAi i TGAQC aaaqaAGAGG 

TCCTCACACC AAATTCAAAG CAGAAGTTTA £GTA ATTCTACTTC 

15 GTGGTCGTCA TACTCCGTTC JTCGCTAACT ^ GTG TAGAAAT 
SSSgcS SS2SSS S?CAC TGTAGAACTG ATTGCACCTA 
TCGCTATGGA AGAAGGTCTG CG 
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2) INFORMATION FOR SEQ ID NO: 125 

( i ) SEQUENCE CHARACTERISTICS : 
,c (A) LENGTH: 820 bases 

Zb lB ) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



(B) STRAIN: ATCC 13120 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 

CGGCGCGACT TGGTATGTTC CGCAGCTGAC ? GTCCTATGC CGCAAACCCG 
CGAACACATC CTGTTGGCTC ^CAAGTAGG JGTACCTTAC 

40 TCATGAACAA ATGCGACATG GTAGACGATto U Q ^qqc^qGA 

GAAATGGAAA TTCGTGACTT GTTGTCAAGC GGTGATGCTG 
CTGCCCAATC ^ACAAGGTT CTGCACTGAA AGCTTT^ ^ 
CTTACGAAGA AAAAATCTTC GAATTtotoC TTCTTGTTGC CTATCGAAGA 
CCAACACCTG AGCGTGCTGT GGACAAACUl u CGCGTAGAGC 

45 CGTATTCTCT ATCTCTGGTC ^GTACAGT AGTAA TCTGAAAGAA 
GCGGTATCAT CCACGTTGGT ^j^AlUto JJ£ ^ aactocTGGA 

ACTCAAAAAA CCACTTGTAC CGGCGTTGAA ATGTic GG TACTAAAC 

cgaaggtcaa gcaggtgaca acgtaggcgt attgctgc taccatcact 

GTGAAGACGT AGAGCGTGGT ^™TTGG gAAA aagaAGAGGG 
50 SSSS ^™Sa CC^TCCACAA TggggC 

SSSSS SSSSSS S25SSS TTGCGCCAAT 
CGCTATGGAA GAAGTCTGCG 



55 
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2) INFORMATION FOR SEQ ID NO: 126 
( i ) SEQUENCE CHARACTERI STICS : 
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(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 
10 (B) STRAIN: ATCC 49226 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 

GGTGCAATCC TGGTATGTTC TGCTGCCGAC GGCCCTATGC CGCAAACCCr 

15 CGAACACATC CTGCTGGCCC GTCAAGTAGG CGTAC CTTAC ATCA^SSS ,nn 

TCATGAACAA ATGCGACATG GTCGACGATG CCGAGCTGTT JSS 

GAAATGGAAA TCCGCGACCT GCTGTCCAGC TACGACTTCC COGGaSS inn 

CTGCCCGATC GTACAAGGTT CCG CACTGAA AGCCTTOCAa 200 

9 n ™CGAAGA AAAAATCTTC GAaSSStI X££S££ 30O 

20 CCGACTCCCG AGCGTGCCGT GGACAAAC CA TTCCTGCTGC CTATOnSJ? «2 

cgtgttctcc atttccggcc gcggtaccgt agtcaccgg? St^tSSS Inn 

GAGGTATCAT CCACGTTGGT GACGAGATTG AAATCGTCGG SSsSSS? l.n 

ACCCAAAAAA CCACCTGTAC CGGCGTTGAA ATGTT^CgS AAC^gSa* tnn 

CGAAGGTCAG GCGGGCGACA ACGTAGGCGT ATTGCTGCGC GgSIpSI?S «2 

25 S55^°° T AGAACGC GOT CAGGTATTGG ?CAA^??GGG SKSSS? Ion 
CCTCACACCA AGTTCAAAGC AGAAGTGTAC GTATTGarrA aari^SnS« °° 

C ^ GCCAT ^CCGTTTT TCGcSSSS S££S£3£ 5??£Sg5 SS 
GTACCACTGA CGTAACCGGC GCGGTTACTT TGGAAAAAGT SSrSUSS °° 

GTAATGCCGG GTGAGAACGT AACCATTAPT A™™**^ JGTGGAAATG 750 

30 CGCTATGGAA GAAGGTCTG? GTAGAACTGA TTGCGCCTAT 800 



35 



2) INFORMATION FOR SEQ ID NO: 12 7 

( i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 CD) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 W ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 127 

^mmmm 

ACTGCCCAAT CGTACAAGGT TCCGCAPTra CCAGGCGACG 200 

GCTTACGAAG AAAAAATCTT C^SSSc? SJSSSS^* AGGCGA TGCC 250 

cccgactccc gagcgtgSg SSSSSSS 300 

SSSSS SSSSE S5SE& Sii cS ~ G "J 

AACCCAAAAA ACCACcSSa SSSSSSS SSJSJSJS 450 

ACGAAGGTCA GGCAGGCGAC AACGTAgSS SSSSSS |SJ 



50 
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10 



15 



CGTGAAGAAG TGGAftCGCGG TCAGGT*™ SSS SagSSgG 
TCCGCACACC AAGTTCAAAG »^?T9i* IcCGTCCGCA ATTCTACTTC 

gcggtcgtca cactccgttc ttcgccaact accgt^ gcgtggaaat 
SgtgagaaS taaccStac tgtagaactg attgcgccta 
tcgctatgga agaagg 

2) INFORMATION FOR SEQ ID NO: 128 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
20 • (vi) ORIG^NAL^S^URCE meningitidis 
(B) STRAIN: ATCC 13077 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 

25 cggtgcaatc ^gtatgtt ccgcagccga c^tcctatg ccgcaaaccc 

SUSSES S ™CGAT GCC^CTGT TG^CTGGT 
TGAAATGGAA ATCCGCGACC TGCTGTCCAG ^CCTTGGA AGGCGATGCC 
ACTGCCCGAT CGTACAAGGT TCCGCACTGA AAGCC T ACAGCTACAT 
G CTTACGAAG AAAAAATCTT CGAATTGGCl Q CCTATCG AAG 

CCCGACTCCC GAGCGTGCCG TGGACAAACC TT£ TCGTGTAGAG 
ACGTATTCTC TATTTCCGGT CGTGGTACAG QTCTGAAAGA 
CGCGGTATCA TCCACGTCGG TGACGAGATC AAACTGCTGG 
AACTCAAAAA ACCACTTGTA JCGGTGTTGA ^GTl CGGTAC CAAA 

ACGAAGGTCA AGCAGGCGAC ^JGTAGGCG TATTGUl GTACAATCAC 
CGTGAAGACG TAGAGCGTGG JCAAGTATTG GCTAAAC^ MGAAGAGG 
TCCTCACACC AAGTTCAAAG CAGAAGTATA JGTAC^^ ATTCTACTTC 
GCGGCCGCCA TACCCCGTTC TTCGCCAAUi Q qtGTGGAAAT 

SSAATGCCG CGTAGAACTG ATTGCGCCTA 

TCGCTATGGA AGAAGGTTTG CGCTTTGCGA T 



30 
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2) INFORMATION FOR SEQ ID NO: 129 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

( c ) STRANDEDNES S : Doubl e 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 

209 
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CGGCGCAATC TTGGTATGTT CTGCTGCYGA CGGTCCTATG CCGCAAACCC 
GYGAACACAT CCTGTTGGCC CGTCAAGTAG GYGTACCTTA CATCATCGTG 
TTCATGAACA AATGCGACAT GGTTGACGAT GCCGAAYTGT TGGAACTGGT 
TGAAATGGAA ATCCGTGACT TGCTGTCAAG CTACGACTTC CCTGGYGACG 
ACTGCCCGAT TGTACAAGGT TCTGCACTGA AAGCCTTGGA AGGCGATGCC 
GCTTACGAAG AAAAAATCTT CGAACTGGCT GCCGCATTGG ACAGCTACAT 
CCCGACTCCC GAGCGTGCCG TAGACAAACC GTTCCTGTTG CCTATCGAAP 



50 
100 
150 
200 
250 
300 

" " - — ~ — i-.—i'v- w-i.^v, J. "or [ * I 1 M I f Nflfli ; o C n 

ACGTATTCTC CATCTCCGGT CGTGGTACAG TAGTAACCGG CCGTGTAGAG 400 

±U JSSS?™ TCCACGTTGG TGACGAGATC GAAATCGTAG GTCTGAAAGA 7sl 
" 50Q 

TCCGCACACC AAATTCAAAG CAGAAGTGTA CGTA^GAG? JSEEXSSS SJ 
15 GTGGTCGTCA TACTCCGTTC TTCGCTAACT ACCGTCCTCA ATTCTACTTC 
CGTACTACCG ACGTAACCGG TGCGGTTACT TTGGAAGAAG GTGTAGAAAT 

GGTTATGCCT GGTGAGAAYG TAGCCATYAC TGTAGAACTG ATTGCGCCTA 80 n 

TYGCTATGGA AGAAG 800 

815 

20 



AACCCAAAAA ACCACATGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 
ACGAAGGTCA AGCCGGTGAC AACGTAGGCG TATTGCTGCG CGGTACCAAA 
CGTGAAGAAG TGGAACGCGG TCAAGTATTG GCTAAACCGG GTACCATCAC 
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2) INFORMATION FOR SEQ ID NO: 130 

( i ) SEQUENCE CHARACTERISTICS : 

2 5 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 
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GGCGCAATCT TGGTATGTTC CGCTGCTGAC GGTCCTATGC CGCAAACPPP 
CGAACACATC CTGTTGGCCC GCCAAGTAGG CGTACCTTAC ATCATCGTGT 
4 0 T CATGAACAA ATGCGACATG GTTGACGATG CCGAGCTGTT gSJSgSt? 
GAAATGGAAA TCCGTGACTT GCTGTCAAGC TACGACTTCC CtSSSSaSSE 
S?aSSS SiJSSSS ^GCACTGAA AGCCTTGG^ SSSSgS 

SSSSSS sjssss sssss sssgss. 350 

45 25S35S "SZS2S. «£™ 2SS 

450 
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GCGGTGTTAT CCACGTTGGT GACGAGATTG £K5X£ SS 
ACCCAAAAAA CCACTTGTAC CGGTGTTGAA ATGTTCCGCA AACTOCtStJ 
SSJSJS* GCCGGTGACA ACGTAGGCGT AiSSSSSS " 
CA GTGAAGAAGT GGAACG CGGT CAAGTATTGG CTAAACCGGG TACCATCATT 
50 CCTCACACTA AATTCAAAGC AGAAGTTTAC GTATTGAGTA SSSJSS 

TGGTCGTCAT ACTCCGTTCT TCGCTAACTA CCGTCCTCAA TTCTACTTCC ' 
^ACTACCGA CGTAACCGGC GCGGTTACTT TGGAAGAAGG TGtS»AATO 
GTTATGCCTG GTGAGAACGT AGCCATCACT GTAGAACTGA TTGCACCGA? 
CGCTATGGAA GAAGGTCTGC GCTTTGCGA TGCACCGAT 800 

55 829 
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2 ) INFORMATION FOR SEQ ID NO: 131 
( i ) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: o , f . „, 

(A) ORGANISM: Neisseria subflava 
1Q ( B ) STRAIN: ATCC 14221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 

cggcgcgact T ™ttc cocagctgat ggtcctaxgc ctcaaactcg 

GAAATGGAAA TCCGTGACCT GTTGTCAAGC GGTGACGCTG 
CTGCCCAATC GTACAAGGTT CTGCACT^AK £ CAGCTACATC 
GTTACGAAGA GAAAATCTTC GAATTGGCTG CTGCTCTeUA 
20 CCAACACCTG AGCGTGCTGT ^ACAAACCT JTCTTG CGTGTAGAGC 
CGTATTCTCT ATCTCTGGCC GTGGTACA^l aui TCTGAAAGAA 
GCGGTATCAT CCACGTTGGT GACGAGATCG ^gg,^ aacTGCTGGA 

ACCC ^i SSSSSSJ ATTGCTGCGT ggtaccaaac 

cgaaggtcaa gctggtgaca ctaaaccagg taccattact 

25 gtgaagacgt agagcgtggt Jaagtattgg ctaaacca^ 

CCTCACACCA AATTCAAAGC AGAAGTATAC GTA ^ TTCTACTTCC 
TGGTCGTCAC ACTCCATTCT JCGCTAACaA CGTAGAAATG 

SSSSS 3SSSS5 gKSactga ttgcgcctat 

3 0 CGCTATGGAA GAAG 
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2) INFORMATION FOR SEQ ID NO: 132 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

ii) MOLECULE TYPE: Genomic DNA 



(i 

(vi) ORIGINAL SOURCE: 
45 (A ) ORGANISM: Neisseria weavers 

( B ) STRAIN: ATCC 51223 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

" mSBBB 1 

55 ACGAAGAAAA ^ATCTTTGAA ^TAGCTGCTG CATTGGAT^ ttgaagatgt 25Q 
?£SS£SS GT^SaGT AACTGGTCGT GTA^GCGCG 400 
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GCGTGGTCAA GTATTGGCTA AGCCTGGTWC TATTACTCCG 
CAYACCAAAT TCAAAGCAGA RGTKTATGTW TTGAGYAAGG AArAArAr?? 

= ss ss dLii SE 

SSSSSSK aiT ™ SSSS 



10 2) INFORMATION FOR SEQ ID NO: 133 

( i ) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ochrobactrum anthropi 

(B) STRAIN: ATCC 49188 



2) INFORMATION FOR SEQ ID NO: 134 

(i) SEQUENCE CHARACTERISTICS • . 
(A) LENGTH: 805 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 

(B) STRAIN: ATCC 2 7155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 

212 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 

SSS2S3SS SS 3K£S SSS3E sggssgg ** 

TTCCTGAACA AGTGCGACCA GGTTGaISa? GCAgSSg? 10 ° 

TGAACTGGAA GTTCGCGAAC TTCTGTCGAA ATACGATTTC 150 

aagttccgat catcaagggc tcggctcttg ctgctctgS f^SSS 200 

AAGGAAC TGG GCGAAGACGC CGTTCGTTCG CTGATGGCCQ S^SSJS? 250 

CTACATTCCG ACCCCGGAAC GTCCGATCGA §3^S?tc 3 °° 

TCGAAGACGT TTTCTCGATC TCGGGCCGTG gSooSSS 350 

gttgagcgcg gtatcgtcaa ggttggtSK SJSSg? 1°,° 

caaggcgacg gcgaagacga cggtaaccgg cgttgaaatg ttcoSSSJ 

tgctcgayca gggccaggct ggcgacaaca tcggcgctS gI?c?S£3£ «2 

GTTGGC CGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA iSAJSSSH 550 

TGTGAAGCCG CACACCAAGT TCAAGGCAGA AGcSaSS CTrirrasnn 6 °° 

ACGAAGGTGG CCGTCATACG CCGTTCTTTA CGAAC?S 650 

TACTTCCGCA CGACGGACGT GACCGGTGTT CTCaSotS SSSSS 7 °° 

ggaaatggtt atgcctggcg acaacgtcgc tatgS?g?c SSJS^S 750 

TGCCGATCGC CATGGAAGAG AAGCTCCGCT TOTCTA A CCCTGATCG 800 
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CCTGGTTGTT GCTGCGACTG ^GCCCAAT A c£cATCGT S«SS 
TCCTGCTGGG TCGTCAGGTT JGCGTTCCTT JCAl TAGAGATGGA 
AAGTGTGACA TGGTTGATGA JGAAGAGCTG CTGGAJ GACACC CCGA 
5 AGTACGTGAC CTGCTGTCAC AGTACGACTT LU-ft£ TGAGTGGGAA 
TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG ^GCGTT TCCCGGATCC 
GCAAAAATCG TTGAGCTGGC TGAACACCTG ^g^T 
AGTCCGTGCG ATCGACATGC CGTTCCTGCT GCCAA ^ qqqcgGCATC 
CAATCTCTGG CCGTGGTACC GTTGTTRL^ qqtaTCAAAG ATACTGCGAA 
10 GTTAAAGTCG GCGACGAAGT JGAAATCGTC G^J^ G ^ccaGGGTC 
ATCAACCTGT AC CGGTGTTG AGATGTTCCG TAAGC GCGTGAAGAC 
AGGCAGGCGA AAACTGTGGT GTTCTGCTGC ^ AGCCGCACAC 
ATCCAGCGTG GCCAGGTTCT JGCTAAGCCA ^ GGTGGCCGCC 
CCAGTTCGAG TCAGAAGTTT ACGTTCTGTC Jg£ T CCGTA CAACT 

15 ATACTCCGTT CTTCAAAGGC TATCGTCCAC £ TGGTCATGCC 
SSSSSS SSS3S tSSaCC^T GATCCACCCA ATCGCAATGG 



ACGAA 



20 



2) INFORMATION FOR SEQ ID NO: 135 

( i ) SEQUENCE CHARACTERISTICS : 
-c (A) LENGTH : 825 bases 

" (B ) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

f-vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea dispersa 

( B ) STRAIN: ATCC 14589 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 

CGCGATCCTG GTTGTTGCTG CGACTGATGG CCCAATGCCT CAGACCCGTG 
AGCACATCCT GCTGGGCCGT CAGGTTGGCG JTCCTa^ ^ crQGTTGA 
CTGAACAAGT GTGACATGGT TGATGAUUAa ^ GGCGACGATA 
GATGGAAGTT CGCGATCTGC TGTCTCAGTA gACTT^ CGACGCTGAG 
CCCCAATCGT ACGCGGTTCT GCGCTGAAAG CGCg^ TA CTTAC ATTCC 
TGGGAAGCGA AAGTCGTTGA JCTGGCTGGT ATCGAAGACG 
AGATCCAGTA CGTGCTATCG ATCTGCCGTT CCTGCTGC^ TGTTGAGCGC 
45 TATTCT CAAT CTCTGGCCGT ^TACCGTOG ^ TCAAAGCGAC 
GGCATCGTGA AAGTGGGCGA CGAAGTAGAA ATCGTT CTGCTGGACC 
TGCCAAGTCT ACCTGTACCG GTGTTGAAAT ^ TATCAAGCGT 
AGGGTCAGGC AGGCGAGAAC TGTGGTGTTC CCATCAAGCC 
GAAGAGATCC AGCGTGGTCA ^TCTGGCT ^^TAAA G AC G AAGGC G 
50 ACACACCAAG TTCGTATCAG AAGTGTACGT ACTGTCa^ CTTCCGX 
GCCGTCATAC TCCGTTCTTC AAAGGCTACC ^g^cG' TTGAGATGGT 
SSSffi OACAACATCH 2S3SSS TGAGCTGATC CACCCAATCG 
CGATGGACCA GGGTCTGCGT TTCGC 
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2) INFORMATION FOR SEQ ID NO: 136 
( i ) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 762 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Pasteurella multocida 
10 (B) STRAIN: NCTC 10322 



15 



20 



25 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 136 

CAGGTGATGA TACACCAATC GTACGTGGTT ra™^ TATGATTTCC 15 0 

GGYGTAGCTG AGTGGGAAGA S?SSJ?S AGCG TTAAAC 200 

TACTTACATT CCAGAGCCAC AACGTGCAAT rrzJZ**^ ACCACTTAGA 250 

cgattgaaga cgtgttctca atttctSJ? SJSJ^SS 6 ttc cttcttc 300 

CGTGTTGAGC GTGGTATCAT SS rllnlnn^l AGTAACAGGT 350 

tattaaagcg acaacgaaS SSSSS2E ^?^IIS g 400 

ssssss sis? iSi ™- is 

sssss sssss Si iEE sssss 6 5 o 5 o° 

sssss ggsss Si skks E G ™ - 

TTCACCCAAT CG TAACAT ^GATGACT GTAAGTTTGA 750 



2) INFORMATION FOR SEQ ID NO: 13 7 

35 (i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE * 

45 £} SS™" 8 ^streptococcus a^aerobius 

q * (B) STRAIN: ATCC 27337 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 

SSSSSS SSS SS2 EES™ c ™caa 

tatttgaata aagcJSJat ggtS^ga*? ^ atcgtagt a ioo 

agaaatggaa gtaagagaat tactatcSI f^SS^™ tagaattagt 150 

AAATTCCAAT CATAACAGGA ToSSSSS >SSSSS CCAGGAGATG 200 

AAATGGATAG ATCAAATCAT OQCaSSto mrlT TGGAGAACAA 250 

TCCAACACCG GAAAGAGCAG TAGA^SS??. ^™ TAG AT( 3AATATAT 3 00 

ACGTATTTAC AATTACAGGA SSS2?S vln^^™ C CAATCGAAG 350 

AGAGGAGTTG TAAAAGTWGG IXSSotS ^^TGAA 400 

GACAACAAAG ACAACTTGTA CYGGAGTarJ GAA ^™S TAG Q AATCAAAGC 450 

ATCAAGGACA AGCAGGAGAT SS5S£S JJJSJSS? 1 ^^TTGG 500 

AAAGAAGATG TAGAAAGAGG SS SSSSSSS GAACAATTCA g 0 ° 
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.cct™ gcrrggg ss sss ississ 

GAGGAAGACA TACTCCATTC ™CAGGAT * GAGTAGAAAT 
S3SSS« SSSSSS 5SS35SS AGTAGAATTG ATTCACCCAA 
5 TTGCAATGGA AACAGGATTA CGATTTGCAA TT 

2) INFORMATION FOR SEQ ID NO: 138 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 ( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (Vi) ^r^cS: 1 Peptostreptococcus asaccharolyticus 

^ ( B ) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 
TAGTATGTTC AGCAGCAGAY GGTCCAATGC CACAAACAAG AGAACACATT 
CTACTAGCAA GACAAGTTGG TGTACCAAM^ TGAATTAGTA GAGATGGAAA 
AGAAGACCAA JTAGACGATC CAGAACTAAT JGAATTAGTA 
TCAGAGACCT ACTATCAGAA TATGACTTC^ ^ Q QAGAATGGGG 

GTAGTAGGAT CAGCATTAAA AGCCCTAGAC ATCCCAACAC 
AGACAAAATC GTAAAACTAA TGGAAGMAGT AGACGAATAC 
CAGTAAGAGA TACAGAACAC CCATTCCTAA Q aacAAGGTGT 

TCAATYACAG GAAGAGGAAC AGTAGCAACA btaftH GACGAAAGCA 

agtaaaagta ggmgacacag tagaactagt aggcttaa cta 

GACAAGTAGT AGTAACAGGT GTAGAAATGT JTAGAAAA -pACAAAGAGA 
GCAGAAGCMG GAGACAACAT TGGAGCCCTA CTAAGAGG^ atcaaaccac 
AGAAATCCAA AGAGGACAAG TACTAGCAGC ™£^MW3h AGAAGGTGGA 
ACACAAAATT TGAAGCAGAA. GTATACGTAC JAAUA^ aCTTCAGAAC 
AGACACACAC CATTCTTTAA CGGATACAGA CCACAATTCT 

££££££ SSKtS tttSa CACTAATCAC ACCAATCGCA 
ATGGACGAAG GACTAAGATT CGC 
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2) INFORMATION FOR SEQ ID NO: 13 9 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
( d ) TOPOLOGY : L ine ar 



(ii) MOLECULE TYPE: Genomic DNA 

55 lvl)t S aa SJSSS',' Peptostrepeococcus prevccii 
(B) STRAIN: ATCC 9321 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 
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CTATCATCGT AGTATCTGCA GCAGACGGTC CAATGCCCAA ACAGAC APA a 

SSSSZSF TAGCAAGACA AGTAGGCGTT CCAAAAATCG i «J 

AAACAAAGAA GACCAAGTAG ACGATC CAGA ACTAATCGAA TTiPTira&» 100 

TGGAAATCAG AGAC CTACTT TCAGAATACG ACTTCGAtSS 150 

CCAGTAGTAG TAGGATCTGC TCTTAAATCA cSrSJIS ^oS^ 00 " 200 

ssss sssss ssai IS IS1 >"° 

sssss esses sssss =i iEi " s 

= sssa sssss si isEi ss 

AACAAGAGAT CAAATCTCAA GAGGACAAGT SSr SJ 

TAAACCCACA CACAGAATTC GAAGGTCAAG TATACGTACT S^KS?? 600 

GAAGGTGGAC GTCACACACC ATTCTTCATT rriT^n^I AACAAAAGAA 650 

CTTTAGAACA ACAGACotS SSSSSS SSE£SK£S 7 °° 

AAATGGTAAT GC CAGGAGAC AACGCAACAT TCAaStcA^ ?5 ° 

CCAATCGCTC TAGAAGAAGG ACTAAGATTC ^Jf^ 70 * 0 ACTCCAAAAA 800 



15 
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2) INFORMATION FOR SKQ ID NO: 140 

(i ) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 831 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytica 

(B) STRAIN: ATCC 25260 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 

^^GCTATC ATCGTAGTTG CTGCAACTGA TGGTCCTATG CCTCAPArrr 

ACACTCCTGT CATCCGTGG? SSSSSE SSSSS 

AAGTGGGTAG AGAAGGTTAT GGAGCTCATG GAGGCTGTAP 250 

CCCACTACCT GAGCGCGACA TCGACAAGCP ttt^pS AGACTTGGAT 3 0° 

ACGTATTCTC TATCACAgS SSSKSS 350 

ACTGGTGTCG TTAAGGTCAA CGATGAgSt 400 

TGAGGGTAAG AAGAGCGTCG TAACTGGCOT rr^pl™ GTCTAGGTGC 450 

S3S2SK ssss £?li lEE IIP 1 - - 

sssgss ssi SSI 1™ sssssss ; 5 °s 

CAGTAGCTTG SSSSgg otSJSS 2 GATCTCM<3 GTCATCTCTC 80 ° 

on 



40 



45 



55 



2) INFORMATION FOR SEQ ID NO: 141 

(i) SEQUENCE CHARACTERISTICS- 
60 (A) LENGTH: 818 bases 



50 



831 



216 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



5 (ii) MOLECULE TYPE: Genomic DNA 

{vi ) ORIGINAL^URCE : gingivaIis 
(B) STRAIN: ATCC 33277 

L ° (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 

CGGTGCTATA ATCGTTGTAG CAGCTACAGA CGGTCCTATG CCTCAGACTC 
GCGAGCA.CAT CCTTTTGGCT CGCCAGGTAA JCG^^ rpcGAGCTTGT 

15 TTCATGAACA AATGTGACAT GGJ^^ CTACGATTTC GATGGTGACA 
TGAAATGGAC ATGCGCGAAC TCCTTTCTJl qcGCTTTGAA TGGAGAGCCT 
ATACCCCTAT CATCCGTGGT TCTGCTCTGG ^11 ACAACTGGGT 
CAGTGGGAAG ACAAGGTGAT GGAGCTTATG ccGGTTGAAG 
TCCCCTGCCT GAGCGCGATA TCGACAAACC f ACGTATCGAA 

20 ACGTGTTCTC TATCACGGGT «3TOOTACQB ^ GCCTCGGTGC 
ACCGGTATTG TGAAGACCGG TGACGAAGTT ^^ TQTTC CGTAAGAT TC 
AGAAGGAATG AAGTCGGTTG TTAL^i^i GTCTCCTCCT QCGTGGTATC 
TTGACGAAGG TCAGGCTGGT GACAACGTTG JTCTCCTCC1 T 
GATAAGGATC JGATCAAGCG TGGTATGGTT ATCTCTCACC HfflHMfl 

25 TACTCCTCAC AAGAGATTTA £f^£^ AATATCGTCC GCAGTTCTAC 
AAGGTGGTCG CCACACTCCT TTCCACAACA J£™ AAGGAACAGA 

SSSSS ACGTAACGAT CACTGTAGAA CTCATCTACC 

CGGTTGCATG TAATGTAG 



2) INFORMATION FOR SEQ ID NO: 142 

-i c (i) SEQUENCE CHARACTERISTICS: 

" (A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 
45 ( B ) STRAIN: ATCC 49100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 

50 
100 

50 GTGAGCACAT CCTGTTA^x ££££££ — SHSScTOT TAGAACTGGT 150 

200 
250 
300 
350 

55 TCCACAGCCA GAGCGT^ — ^tIIcCGG TCGTGTAGAG 400 



60 



sill iis sssss 

lis ssss s« 

mmmmm 



450 
500 
550 
600 



CA 02283458 1999-09-28 



650 



^™^ TACT AACTTCGTAT CAGAAGTTTA TATCCTGAGC AAAGATGAAP 
GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTYTACTTC 
CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCCTArSS 32° 
GGTAATGCCA GGTGATAACA TTCAGATGAC TGTAACTCTG 750 
TCGCGATGGA CGAAGGTTTA CGCTTCGCTA ^ iAACTCTG ATTGCCCCAA 800 



35 



40 



45 2) INFORMATION FOR SEQ ID NO: 144 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
50 CO STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevotella oralis 



60 



(B) STRAIN: ATCC 33269 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 



830 



2 ) INFORMATION FOR SEQ ID NO: 14 3 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 

25 TGCTGCTATC TTGGTTGTAG CTGCTACTGA TGGTCCTATG cCTCAP&nr 

SSSS 2SSSSS SSSJSS sll Eill "o 

3 0 TGGGTAGACT CTGTAATGGA GCTCATGGAT SSSSSS SSSKS? 25 ° 
AGAGCCAGAG CGTGAGATTG ACAAGCCATT CTTGATGCCT 
TATTCTCTAT CACAGGTCGT GGTACTGTAG CTACTGGTCG 
™^ TGTA AGGTAGGTGA TGAGG^AG SSSSSS? 
CAAGAAGTCT GTTATCACTG GTGTTGAGAT GTTCCrTAar 

CAGGTCAGGC TGGTGACAAC GTAGGTCTCC JcC^TCGTCG 5 °° 

GCTGAGGTTA AGCGTGGTAT GGTTGTTGTG CA^CcSSS SISS^S 550 

TCACGATCAC TTCAAGGCAT CTATCTATGT SJSnSKS ^ A ^ ACTCC 600 

GTCGTCATAC TCCATTCGGT AACAAGTATC rJSS?™ ? VAGAGGGTG S50 

ACAATGGACT GTACAGGTGA AATCCACCTC cSSSS£ S£ AC ^° CGT 700 

TATGCCAGGT GACAACGTAG iS5S£SS £3Sg££? ?5 ° 

CTTTGAACGA GGGTCTTCGT T ATTGATC TATAAGGTTG 800 



50 



300 
350 
400 
450 



821 



218 



CA 022834S8 1999-09-28 



■ > » • 

TGGTGCTATT CTTGTAGTAG CT^TGA CGGTCCTATG CCTCA^CTC 

GTGAACACGT GCTTCTTGCT CGTCAGGTGA XTGAGCTCGT 
TTCTTGAACA AGTGCGATAT GGTTGACGAT UAft GAGGAGGATA 

AGAAATGGAG CTTCATGAAC ™TCGM3CA Gl AGTAGAGAAG 

5 CTCCTATTGT TCGTGGTTCG ^TTGGCG ^ Q aatqgaTACA 

TGGGTTGACA GCGTGATGAA GTTGATGGAT JCC^ GTAGAGGATG 
GGAACCACCG CGTGATCTTG ATAAGCCTTT CTTGAg TATTGAAACT 

TATTTTCTAT TACTGGTCGT GGAACGGTTG JTACAG TCGGTGAAGA 

GGTAAGGTTA AGGTGGGCGA TGAAGTTCAA CTTCT ATTCTTGACG 

10 TAAGAAGTCC GTTGTGACAG GCGTTGAGAT ^ TATCGATAAG 

AAGGTGAAGC TGGTGATAAT GTAGGCTTGC ^ CTATTACTCC 
ACGGAAGTAA AGCGTGGTAT GGTTGTCGTA CA^ ^ qaagaaGGCG 

sjsss ssss sssgg ?SS SSSS 

15 SSS JSES3S taagttgatc tatccggtag 

OTTGAACGA GGGACTTCGT TTCGCTA 

20 2 ) INFORMATION FOR SEQ ID NO: 145 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(B) STRAIN: ATCC 6919 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 

CGGCGCCATC CTCGTGGTTG CTGCTACCGA C^CCCGATG CCTCAGACTC 
GCGAGCACGT TCTGCTCGCT ^GTCAGGT^ ^AGGAGCTCA TTGAGCTCGT 
GCCCTCAACA AGTGCGACAT GgTTGACGAT GAJGAG GACGGCGACA 
4 0 CGAGATGGAG GTCCGCGAGC TGCTGACC1U uuj GGGTGATGAG 
ACTGCCCTGT CGTTCGCATC TCCGCCTTCC ^CCCT^ua TACAT 
AAGTGGACCC AGTCGATCCT CGACCTCATG ^GCCGa ccGATCGAGG 
CCCGCAGCCT GAGCGCGATC JCGACAAGCC gTCCTTAT^ 
ACGTCTTCAC CATCACCGGC ^GTGGCAC^ i±£ GT ATC C ACGA 
45 CGCGGCGTCG TC^GACTGG g^GAGGTC gggOTC^ CGCAAGATCC 
GAAGACCCAG AAGACCACCG ™CCGOi^i u£ CCGTGGCACC 
TCGACGAGGG CCGCGCTGGT GAGAACGTCG gGTTCTG^ CTGGXXCC AC 
AAGAAGGAGG ATGTCGTTCG CGGCATGGTC gCTCCAJ j^ GMiGGIi TG 
CACCCCCCAC ACCGACTTCG AGGGCCAGGT CTACGTCC cCAGTTCTAC 
50 AGGGTGGCCG CCACAAGCCG JTCTTCTCCC ^ AGGGCACCGA 
SS SSSSSSS ISSSS? SpSTSCC CTGATTCACC 

2S5??SccSt ggaggatcag ctcaagttcg cta 



25 



30 



55 



60 



2) INFORMATION FOR SEQ ID NO: 146 

{ i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 745 bases 



50 
100 
150 
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(B) TYPE: Nucleic acid 
C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 35659 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 S 

SSSSS SS33E ^ CAGGTOGG »™«™c 

AGAATTAGTT GAAATGGAAG SSSSSS W^CTGTT 10 0 

CAGGTGATGA CACTCCAGTA ATCcStSS? ^n^Z ^ TACGATTTCC 150 

GGCGAAGCAG AGTGGGAAGC JSSJ?2S £X2£JS* AGCACTGGAA 200 

TTCTTATATC CCAGAGCCAG j£S3i£55 JJSSF™ 250 

CAATCGAAGA TGTATTCTCA ATrTPArrrP ^S^ 0 ^ TTCCTGTTAC 300 

CGTGTAGAGC GTGGTATCAT C^SSJSr? SSI^*" AG TTACTGGT iso 

TATCAAAGAA ACCGCCAAAA ??^^2^ G AGATTGTTGG 400 

AATTACTTGA CGAAGGTCGT SaSSSS* ;f GGC -GTTGAA ATGTTCCGTA 4 50 

GGTACAAAAC S£K£S£ SSSSSSZ TCTGC TGCGT 5o2 

CTCAATCAAC CCACACAACA AAmsSS? S^™^ C RAAACCAGG 550 

AAGATGAAGG TGGTCGTCAC JSSSSJS JS^SSI ATTCTGAGGA 600 

TTCTACTTCC GTACAACTGA CGTaSSS ISSJSSS GCG J CCACAG 650 

CGTAGAAATG GTAATGCCAG OCg52SSJ S5£5SS? gS^ 7 °° 



2 ) INFORMATION FOR SEQ ID NO: 147 

( i ) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 829 bases 
35 JB) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



45 
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(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus penneri 

(B) STRAIN: ATCC 33519 



50 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 

SSSSgg SSSS ESS3£S Kg™" aauuicicQ 

TCCTGAACAA ATGTGACATG otSSSSg 12322^ ATCATCG TAT ioo 

GAAATGGAAG TTCGTGAACT TCTOtSKS JSXS*" GGA ATTAGTM 150 

CACTCCAGTA ATCCGTGGTT CAgSSS*? * ACGATTTCC CAGOTGATGA 200 

AGTGGGAAGC AAAAATtStT iSS25S£ GGCGAAGGAG 250 

CCAGARCCAG AGCGTGCAAT T^rA^aS? 0 AAGCACTGG A TTCATACATC 3 00 

CGTATTCTCA ATtSSK?? OTtoSSJ 2K22S a*™*** 3sS 

GTGGCGTAAT CAAAGTTGGT gSSoSctS J^™! CGTGTTGAGC 400 

ACAGCGAAAA CAACTTGTAC TGGCPTTna? A ^ A ^ CGTTGG TATTAAACCA 4 50 

CGAAGGTCGT GCAGGTGAGA jEoSSSS A ^ CCGTA ^TTACTTGA 500 

GTGAAGAAAT CGAACGTGGA cSgtSS £5125^ GG< ^ACTAAAC 550 

CCACACACTA AATTTGAATC AGAAGtJt?? CGAAAGCAGG T TCAATCAAC 60 0 

TGGTCGTCAT ACTCCATtS J32SSS SSSSSS SSSgJgg «° 
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sssa sssffi « sssss 

CGCGATGGAC GACGGTTTAC GTTTCGCTA 

5 

2) INFORMATION FOR SEQ ID NO: 14 8 

(i) SEQUENCE CHARACTERISTICS: 
lft (a) LENGTH: 824 bases 

10 m( TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

^^cX'- Proteus vul^ris 
(B) STRAIN: ATCC 13315 
20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 

CGGAGCTATT CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 
GTGAGCACAT CCTGTTAGGT CGCCAGGTTG ^ TGGAATTAGT 
25 TTCCTGAACA AATGTGACAT GGTTGATGAi CCAGGTGATG 
AGAAATGGAA GTTCGTGAAC TTCTGTCTCA ^CGAT AGGCGAAGCT 
ACACTCCAGT AATCCGTGGT TCAGCGCTGA ATTCTTACA t 
GAGTGGGAAG CAAAAATTGT TtAA^^ ATTCCTGCTG CCTATCGAAG 
CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGC^ CCGTGTAGAG 
3 0 ACGTATTCTC AATCTCTGGT CGTGGTACAG WV^JL GTATTAAAGA 
CGTGGTGTTG TTAAAGTTGG TGAAGAAGTT ^tacttG 
CACAGTTAAA ACAACTTGTA CTGGCGTTGA AATG1 TGGTACTAAA 
ACGAAGGTCG TGCAGGTGAG JACGTAGGTG T ^ GTTC AATCAA 

CGTGAAGAAA TCGAACGTGG ACAAGlAt-i^ TGAQC aaagATGAAG 

35 gccacacact aaattcgaat cagJJJJTTA Jatcctga^ tacttc 
gtggtcgtca cactccattc ^caaaggtt accgtc gcgtagaaat 
SSSEX 2SKE2X SESS3S cgttgaactg attcacccta 
tScgtagga cgacggttta cgtt 



40 



750 
800 
829 



2) INFORMATION FOR SEQ ID NO: 149 

a c (i ) SEQUENCE CHARACTERISTICS: 

45 ( A ) LENGTH: 745 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

<Vi,< ?S GI ^?IS?"i.rovide„ci. alc.lif.cien. 
55 ( B ) STRAIN: ATCC 9886 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 

60 SSSS ggSSS SSSSSSS 32S3S 

221 
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10 



15 



^^^ TAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTrr 

CAGGCGATGA CACTCCAGTT GTTCGCGGTT CAGCACtSS AGCGCTGgS ™2 

GGCAACCCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG ctS£5SS? 2 °° 

TTCTTACATC CCAGAACCAG AGCGTGCAAT TGACAAG?c5 SJJSSXS 250 

CAATCGAAGA CGTATTCTCA ATCTCTGGTC GTG^ACT SSSrS^ 3 °° 

CGTGTTGAGC GTGGTATCAT CAAAGTTGGT GAAGAAOT?? mISSSS^ 350 

ISS^ G ACTG ^AAAA CAAC??G?A? SSSSJ52 122 

^"GCTGGA TGAAGGTCGT GCGGGTGAGA ACGTTGGTGT TCTOCToSJ JnS 

GGTACTAAAC GTGAAGAAAT TCAACGTGGT CAAGTACTGG rrS^J 500 

TTCAATCAAG CCACACACTC AATTCGAATC AGA^otSa? SJSSS 550 

AAGATGAAGG TGGTCGTCAT ACTCCATTCT Tm^I ATTCTGAGCA 600 

TTCTACTTCC GTACAAcJgJ CGTAACCGGT ISSSSS SSSK^ 65 ° 

CGTAGAGATG GTAATGCCAG GCGACAACAT ^SSSS g?SaC ^ ?J° 



30 



35 



40 



45 
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2) INFORMATION FOR SEQ ID NO : 151 

( i ) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 826 bases 
(B TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



2) INFORMATION FOR SEQ ID NO : 150 

(i) SEQUENCE CHARACTERISTICS- 
20 (A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 150 

TGAAATGGAA GTTCGTGAAC TTCTGtSS SSSfS^ ^AGAATTAGT 150 

ACACTCCAGT TGTCCGTGGT TCAGCtSSJ fSSSEE? CCAGGC GACG 200 

GAGTGGGAAG CGAAAATTGT TGAMtSS J^™^ A GGCAACCCA 250 

CCCAGAACCA GAGCGTGCAA TTGACAAACC A "CTTACAT 300 

ACGTATTCTC AATCTCTGGT CGTrrTara? ^° C ?2 CTG CC AATCGAAG 350 

CGTGGTATCA TCAAaStGG 5gAa1SS? SXSSSSS 6 CCGTG TTGAG 400 

CACGGTTAAA ACAACTTG^A SS3SJ5S 253552 SSJSSS* 3 * 450 

ACGAAGGTCG TGCGGGTGAG AACCTTrr^ AAACTGCTGG 500 

CGTGAAGAAA TTCaSgtSg tcSgSg TGGTACTAA * 550 

GCCACACACT AAATTCGAA? SgA^GtcS SSSSSS? ???^ CAA 600 

GTGGTCGTCA CACTCCATTr T-rnVV^^i^ 1ATTCTGAGC AAAGATGAAG 650 

CGTACAACTG SSSS f™ 0 *™ 1 ™ 

GGTAATGCCA GGTGATAACA TCAACATGAT* GCGTAGAGAT 750 

TCGCGATGGA CGACGGTTTA CGTTTCgSS CGTTACCCTG ATCCACCCAA 800 



50 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ^ GIN SH^S? : providencia rustigianii 

5 (B) STRAIN: ATCC 33673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 
CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 
10 GTGAGCACAT CCTGTTAGGT CGCCAAGTAG GT^ T TAG AATTAGT 
TTC CTGAACA AATGTGACAT «3TTGACGW. g£ CCAGGCGACG 
TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTA ^ AGGTATCCC t 
ACACTCCAGT TGTTCGYGGT TCAGCACauj CTQG ACAGC TACAT 

GAGTGGGAAG CGAAAATTGT TGAATiA^u aTTCCTGCTG CCAATCGAAG 

15 CCCAGAACCA GAGCGCGCAA TTGACCGTCC Aiiu SCGTGTTGAG 
ACGTATTCTC AATCTCTGGT CGTGGTACAG G TATCCAAGA 
CGTGGTATCG TTAAAGTTGG TGAAGAAGTT Gj^TCGT TGCTTG 
CACRGTTAAA ACAACTTGTA CTGGCGTTGA AATGl X TGGTACTAAG 
ACGAAGGTCG TGCTGGTGAG AACGTTGGTG TTITA^ GTTCAATCAA 

20 CGTGAAGAAA TTCAACGTGG TCAAGTALiu AAAGATGAAG 
GCCACACACT ACTTTTGAAT CAGAACTTTA ^ GTTCT ACTTC 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT JCOOTCC^ q^qTAGAGAT 

%5fi£5i SSSSSSS tcSStgat cgtgacactg attcacccaa 
25 tcSc^atgga tgatggttta cgtttc 



30 



2) INFORMATION FOR SEQ ID NO: 152 

(il'sEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 ( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: .. 111f , Hi 

(A) ORGANISM: Providencia. stuartii 

( B ) STRAIN: ATCC 33672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 
45 CGGTGCAATC CTAGTTGTTG CGGCAACAGA TGGCCCAATG CCACAAACTC 
GTGAGCACAT CCTGTTAGGT CGTCAGGTTG ^^gg^ TGGAACTGGT 
TTC CTGAACA AATGTGACAT GGTAGACGAC CCA GGTGATG 
TGAAATGGAA GTTCGTGAAC TTCTGTCTCA AGGCAACCCA 
ACACTCCAGT ^ATCCGTGGT TCAGCGCTGA JAGCGTT^ ACAGCTACAT 
50 GAGTGGGAAG CGAAAATCGT TGAACTAGCA CCAATCGAAG 
CCCAGAGCCA GAGCGTGCAA rcGACAAGCC Aiiu CCGTGTTGAG 
ACGTATTCTC AATCTCAGGT CGTGGTACAG JAGTC GTATCAAAGA 
CGTGGTATCA TCAAAGTTGG TGAAGAAGli £££ TXC cGT AAACTGCTGG 
GACTGCGAAA ACCACTTGTA CTGGCGTTGA TGGTAC TAAG 

55 ACGAAGGCCG TGCGGGTGAG AACGTAGGTG TTCTGCTGCG 

CGTGAAGAAA TCGAACGTGG TCAAGTTCTG G^ AAAGATGAAG 
GCCACACACA JCTTTCGAAT CAGAAGTTTA TATTC ^ GTTCTACTTC 
GTGGTCGTCA CACGCCATTC ^CAAAGGXi CTGC CAGAAG GCGTAGAGAT 
60 SSSSSi ^SSSS SEEKS AGTAACTCTG ATTCACCCAA 

223 
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TCGCGATGGA CGATGGTTTG CGTTTCGCAA 

5 2 ) INFORMATION FOR SEQ ID NO: 153 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 827 bases 
nn i B l TYPE : Nucleic acid 

±U (O STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE- 

(A) ORGANISM: ' Pseudomelia s aeruginosa 

(B) STRAIN: ATCC 35554 

2Q (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 

Sggg? CGCGAG ^ 50 

AAAGCCGACA TGGTCGACGA CGCCgISS ?^™ GT GTTGG TGAAC i 00 

AGTTCGCGAT CTGCTGAACA SSSSS? £°*»WX» 150 

TCATCATCGG TTCCGCGCTG ATGGCGCTGG Sr^n^ 0 ^CTCCGA 200 
ATCGGCGTAA GCGCCGTGCA GAAGCTGrS ^^S^ TGAG AACGGC 
TCCGGAGCCG GTTCGTGCCA ^SSHFH? ACTCCTACAT 



25 



30 



35 



40 



A1L.UGCGTAA GCGCCGTGCA GAAOPTrrTa r^^7Z^Z ^^^^C 250 

TCCGGAGCCG GTTCG?g2S SSSSSS ACTCCTACAT 3o5 

ACGTGTTCTC GATCTCCGGT CGCGGTAC^S CCGATCGAAG 350 

CGCGGCATCA TCAAGGTCCA GGAAGAAgSS ??°n^ AC So G TCGTGTAGAG 400 

GACCACCAAG ACCACCTGCA CCGGCrr??? ^^TCGTCG GCATCAAGGC 450 

ACGAAGGTCG TGCTGGTGAG S?™^ ^^ G ° ^CTGCTCG 500 

CGTGAAGACG TAGAGCGTGG CcSSSStc crrl^nn^ TGGCACCAAG 550 

GCCGCACACC AAGTTCGAGT gSSgTCTA SSSSSS ?SS^™ 600 

GTGGTCGTCA CACCCCGTTC TTrzZnnnXt CGTGCTGTCC AAGGAAGAAG 650 
CGTACCACKG aJS^S SSKSSSX* GTTCT ACTTC 

GGTAATGCCG GGCGACAACA ToSSSSS °J~S CGGAAG GCGTAGAGAT 750 

TCGCCATGGA AgSSSSS CTTCGCG TGTCAGGG TG ATCGCTCCGA 800 



2) INFORMATION FOR SEQ ID NO: 154 

(i) SEQUENCE CHARACTERISTICS • 
JA) LENGTH: 841 bases 
J B > TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE • 

(A) ORGANISM: Pseudomonas fluorescens 

(B) STRAIN: ATCC 13525 u °rescens 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 154 

SSSSSSS SSSS3SS S GATG cga caaaccc 

TACCTGAACA AGGCTGACCT GGtS^S o^ TCCGTA CATCGTGGTT ioo 
SO TGAGATGGAA GTGCGC^5? SSgSggg SS2g£ SgSSSS gj 

224 
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la • • 

ACACTCCGAT CAT^TCGGT TCTGCTCOTA TOGCTCTaOA AOGCAAAGAC 

GACAACGAAA TGGGCACCAC GTCCGTTCGT ^ TTCCTGATGC 

CAGCTACATC CCAGATCCAG TTCGTGTTAT TGTGACTGGT 

CAATCGAAGA CGTGTTCTCG ATCTCCGGTC ^GTA mV &rSQa 

5 CGTATCGAGC GCGGTATCGT TAAGGTTCAA ^ ATGTTCCGTA 

TCTGCGTGAC ACTACCGTCA CCACCTGCAC ^ TCTGCTGCGT 

AGCTGCTCGA CGAAGGTCGT GCTGGCGAGA AC1£ TTAAGCCAGG 
GGTACCAAGC GTGACGACGT JGAGCGTGGC CMGTTC^ GTACTGAGCA 

TTCGGTTAAG CCGCACACCA AGTTCGAAGC CCGTCCACAG 

10 AAGAAGAAGG CGGTCGTCAC ACTCCGTTCT TGCCGGAAGG 

TTCTACTTCC GTACTACTGA CGTGACTGGT AACT^ GTTACCCTGA 

SSEKSSS S3SSS3S SSSSJSS gtttcgctat ., 



15 2) INFORMATION FOR SEQ ID NO: 155 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 
20 ( C ) STRANDEDNESS : Double 



25 



30 



(D) TOPOLOGY : Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ^ GIN oiGSSS E: pseudo m onas stutzeri 

(B) STRAIN: ATCC 17588 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 
CGGCGCGATC CTGGTCTGCT CGGCTGCTGA CGGCCCCATG CCGCAGACTC 
GCGAGCACAT CCTGCTGTCC CGTCAGGTTG TCGAGCTGGT 
TTC CTGAACA AGGCCGACAT JGTTGATGAC ^ CCGGGTGAYG 
35 CGAGATGGAA GTTCGYGACC TGCTGTCGAC CTA CGGCGAAGAC 
ACACTCCGAT CATCATCGGC TCCGCGCTGA w AGACCCTGGA 
GACAACGAGC TCGGCACCAC TGCGGTGAAU JJ^ CQ ttccTGATGC 

CAGCTACATT CCCGAGCCGG JTCGTGCCAT CGACAA^ 
CGATCGAAGA CGTGTTCTCG Aiuic.uv3 GAAGAGATCG AGATCGTCGG 
40 CGCGTAGAGC GCGGCATCGT ^GGTTCAG GAAGAg ATGTTCC GCA 
TCTGCGTCCG ACCACCAAGA CTACCTGCAC CGGCGTT QCTGCTGCGT 
AGCTGCTCGA YGARGGTCGT GCTGGCGAGA CCAAGCCGGG 
GGCACCAAGC GTGACGAAGT GGAGCGTGGT CAGGTTC GTGCTGTCCA 
CACCATCAAG CCGCACACCA AGTTCGAAGC <_W£ CCGTCCKCAG 
45 AGGAAGAAGG TGGTCGTCAC ACCCCGTTCT TYAAGGGC1 TGCCGGAAGG 

IS? Eiil IS ssss s— 

50 

2) INFORMATION FOR SEQ ID NO: 156 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
( c ) STRANDEDNES S : Doubl e 
(D) TOPOLOGY: Linear 

6 0 (ii) MOLECULE TYPE: Genomic DNA 

225 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 

(B) STRAIN : ATCC 23333 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 

GCTATTCTAG TAGTATCAGC AACTGACGGT CCAATGCCAO aaararrrp, 

GCACATTCTA TTATCACGTC AGGTTGGTGT AoXSSSS A^SS™ n 5 ° 

10 ^? AAATG TGACATGGTA GATGACGAAG AGTTAcSgA SSaotSS J2X 

ATGGAAGTGC GTGAATTACT TTCAGACTAC GACTTCCCAG SSSSiSS l 5 ° 

TCCAATCATC AAAGGTTCAG CTTTAGAAGC GTTAAAtS? fZrnlnn^* 200 

AGTACGGTGA GCCAGCAGTT ATCGAACTAC TAAACACTCT j^SSSKK 250 

ATTCCAGAGC CAGAGCGTGA CATCGATAAG CCATtcSaI JSSS??™» 300 

is agacgtattc tcaatctcag gtcgtggtac actIg?Sc! JSSiSS 350 

AATCTGGTAT CATCAAAG TT GGTGACGAaJ TCGAA^TCGT S^SSI? 4 °° 

GACACAGTTA AAACAACTTG TACTGGTATC GAGATCTTrr nvTZnZZ^ 450 

"Sgg** 1 CGTGCTGGTG AGaSgTGG SSSSSS SSS^JS XX 

AGCGTGAAGA CGTACAACGT GGTCAAGTAC TTGrTia^o ao^S^ CTA 550 

20 ACTCCACACA CCAACTTCGA CGCAGAAGTA 3^i2S£ SKEEES? 6 °° 

AGGTGGTCGT CACACTCCAT TCTTAAATGC TT»^n£™ CAAAAGAAGA 650 

TCCGTACTAC TGACGTAACA GGTGCAATCA Sn^ TACT 700 

ATGGTAATGC CAGGCGATAA cSESSSSS AgStAGaIS iSSSSSXS ?5 ° 

AATCGCTAGG ACAAAGGTTT ACGTTTCGCA atc TAATCCACCC 800 

25 833 

2) INFORMATION FOR SEQ ID NO: 157 

30 (i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 
40 (B) STRAIN: ATCC 33071 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 157 

45 SSSSc^^SSS^ 

TCATGAACAA ATGCGACATG GTAGATGArr zinln^^S. ATGATCGTGT 100 

GAAATGGAAG TTCGCGAACT TCTGtSSS ?JS?SS^ GGAAC TGGTA 150 

CATCCCGGTC ATCAAAGGTT CAGCgSgS SSSSJ*? GGGG ACGA 200 

CTTGGGAAGC GAAAATCATC GAAotSS? a^™^ 3 ^ GGCGATGCTA 25 0 

CCATTGCCAG *SSS?SSS SISSS? ^2SgSS SKSSE 1 300 

CGTATTCTCC ATCTCCGGTC GTGGTACAGT GGtoSSS ^S GAAGA 35 <> 

GCGGTATCGT TAAAGTGGGC GAAG»AGTTr SSSSSS^ CGTGTAGAG C 40 0 

ACTGTTAAGT CTACTTGTA? SStoSJSSS iJSESSS SSSSS* 450 

CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT tSS^ AACTGCTGGA 500 

GTGAAGACAT CGAACGTGGT CAGOTTCTO? rZZ**n^~ T G GTATCAAGC 550 

CCACACACCA AGTTTGATTC CGAAGtcS S^ESS 0 TTCAATCAAA 600 

TGGTCGTCAC ACTCcStCT SSXSSSf SSSSXS AAGATGAAGG 650 

GTACAACTGA CGTGACCGGT aSKSEE ^ ACTTCC 700 

GTTATGCCTG GTGACAACGT GAACATGGT? SSSSJ^ GGTTGAGA TG 750 

60 CGCGATGGAT GACGGTCTGC GTTTC GT TACCCTGA TCCACCCAAT 800 



50 



55 



825 
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2) INFORMATION FOR SEQ ID NO: 158 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE: , „ 

(A) ORGANISM: Salmonella choleraesuis subsp. anzonae 

(B) STRAIN: ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

rOTGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 
StcSSSt cctXtgggc CGTCAGGTAG GCGTTCCGTA CATCATCGTG 
??2c?g£SI aatgcgacat GGTTGATGAC GAAGAGCTGC tggaactggt 
??JS?SgS gttcgtgaac ttctgtctca gtacgacttc ccgggcgacg 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 
SIS5gggaag CGAAAATCAT CGAACTGGCT GGCTTCCTGG ACTCTTACAT 
mgcuS^ gagcgtgcga TTGACAAGCC GTTCCTGCTG CCGATCGAAG 
ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 
SqSSStS TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 
SSSSJS TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 
acgJaggccg TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 
cotS^SSaa TCGAACGTGG TCAGGTACTG GCTAAGCCGG gcaccatcaa 
ScCQCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 
2g?aSaSS ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTGGAGAT 750 
GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 
TCGCGATGGA CGACGGTCTG CGTTTCGCAA 
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2) INFORMATION FOR SEQ ID NO: 159 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype choleraesuis 

(B) STRAIN: ATCC 7001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGYCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 
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ACACGCCGAT CGTTCGTGGT 
GAGTGGGAAG CGAAAATCAT 
TCCGGAACCA GAGCGTGCGA 
ACGTATTCTC CATCTCCGGT 
CGCGGTATCA TCAAAGTGGG 
GACTCAGAAG TCTACCTGTA 
ACGAAGGCCG TGCCGGTGAG 
CGTGAAGAAA TCGAACGTGG 
GCCGCACACC AAGTTCGAAT 
GCGGC CGTCA TACTCCGTTC 
CGTACTACTG ACGTGACTGG 
GGTAATGCCG GGCGACAACA 
TCGCAATGGA CGACGGTCTG 



TCTGCTCTGA AAGCGCTGGA 
CGAACTGGCT GGCTTCCTGG 
TTGACAAGCC GTTCCTGCTG 
CGTGGTACCG TTGTTACCGG 
YGAAGAAGTT GAAATCGTTG 
CTGGCGTTGA AATGTTCCGC 
AACGTAGGTG TTCTGCTGCG 
TCAGGTACTG GCTAAGCCGG 
CTGAAGTGTA CATTCTGTCC 
TTCAAAGGCT ACCGTCCGCA 
CACCATCGAA CTGCCGGAAG 
TCAAAATGGT TGTTACCCTG 
CGTTTCGCAA TC 



AGGTGACGCA 


250 


ATTCTTACAT 


300 


CCGATCGAAG 


350 


TCGTGTAGAG 


400 

~ u u 


GTATCAAAGA 


T J v 


AAACTGTTGG 


500 


TGGTATCAAA 


550 


GCACCATCAA 


600 


AAAGATGAAG 


650 


GTTCTACTTC 


700 


GCGTAGAGAT 


750 


ATCCACCCGA 


800 




832 



2) INFORMATION FOR SEQ ID NO: 160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraeauis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATTATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCAA 2 00 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 4 50 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAARCTGCTG GACGAAGGCC 500 

GTGCWGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 55 0 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 6 00 

CAAGTTC GAA TCTGAAGTGT ATATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 700 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACGG Q07 



2) INFORMATION FOR SEQ ID NO: 161 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(viJORIGINAL^URCE:^^^^ cholerae8uie subsp . cho ler ae suis 

serotype heidelberg 
(B) STRAIN: ATCC 8326 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 

„„„ pnirrp PTCPTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 
CG S CG £?S t??gSgSgt CGTCAGGTAG GCGTTCCGTA CATCATCGTG 
GTG ~S™??I S?g?5acat GGTTGATGAC GAAGAGCTGC TGGAACTGGT 
TTCC ^St ???cg?gSc TTCTOTCTCA GTACGACTTC CCGGGCGACG 
TGAAATGGAA GTCCCTGAAC ^^1^1 AAGCGCTGGA AGGCGACGCA 
ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AA6C ^ ATTCTTACAT 
GAGTGGGAAG CGAAAATCAT CGAACTG^ QTTCCTGCTG CCGATCGAAG 
CCCGGAACCA GAGCCTGCGA ™ALAA^ ^q^ccqg XCGTGTAGAA 

gSS SEEKS SSSSSSS gIaatcgttg gtatcaaaga 

CGCGGTA1L.A ^^2?^^ rTnnPGTTGA AATGTTCCGC AAACTGCTGG 

S££££ 2SSSSS5 tctgctgcg tggtatcaaa 

acgaaggccg tcaggtactg gctaagccgg gcaccatcaa 

IS S ISss ESS 3S3ES 

liill IS ssss ssss ssss 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 



50 
100 
150 
200 
250 
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2) INFORMATION FOR SEQ ID NO: 162 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

^^SSnSSS?' Salmonella ctoleraesuis subsp. 

(B) STRAIN: ATCC 43974 



houtenae 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 



CCTGGTTGTT 
TCCTGCTGGG 
AAATGCGACA 
AGTTCGTGAA 
TCGTGCGTGG 
GCGAAAATCA 
AGAGCGTGCG 
CTATCTCCGG 
ATCAAAGTGG 
GTCGACCTGT 
GTGCTGGCGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
GACGTGACTG 



GCTGCGACTG 
TCGTCAGGTA 
TGGTTGATGA 
CTTCTGTCTC 
TTCTGCTCTG 
TCGAACTGGC 
ATTGACAAAC 
TCGTGGTACC 
GCGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GCACCATCGA 



ACGGCCCGAT 

GGCGTTCCGT 

CGAAGAGCTG 

AGTACGATTT 

AAAGCGCTGG 

TGGCTACCTG 

CGTTCCTGCT 

GTTGTTACCG 

TGAAATCGTT 

AAATGTTCCG 

GTTCTGCTGC 

GGCTAAGCCG 

ACATTCTGTC 

TACCGTCCGC 

ACTGCCGGAA 



GCCGCAGACC 

ACATCATCGT 

CTGGAACTGG 

CCCGGGCGAC 

AAGGCGACGC 

GATTCTTACA 

GCCGATCGAA 

GTCGTGTAGA 

GGTATCAAAG 

CAAACTGCTG 

GTGGTATCAA 

GGCACCATCA 

CAAAGATGAA 

AATTCTACTT 

GGCGTGGAGA 



CGTGAGCACA 50 

GTTCCTGAAC 100 

TAGAJVATGGA 150 

GACACGCCGA 200 

AGAATGGGAA 2 50 

TCCCGGAACC 300 

GACGTATTCT 3 50 

GCGCGGTATC 400 

AGACTCAGAA 450 

GACGAAGGCC 500 

ACGTGAAGAA 550 

AGCCGCACAC 600 

GGCGGCCGTC 650 

CCGTACGACT 700 

TGGTAATGCC 75 0 
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GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 
ACGACGG 80 7 



2) INFORMATION FOR SEQ ID NO: 163 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 827 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43976 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACCCGTG 5 0 

AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTC CGTACAT CATCGTGTTC 10 0 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTG CTGG AACTGGTTGA 15 0 

2 5 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGTGACGACA 20 0 

CGCCGATCGT GCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATTCC 3 00 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 35 0 

TATTCTCCAT CTCCGGTCGT GGTACYGTTG TTACCGGTCG TGTAGAGCGC 400 

3 0 GGTATCATCA AAGTGGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 45 0 

TCAGAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC CGGTGAGAAC GTAGGTGTTC TGC TGCGTGG TATCAAACGT 55 0 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCA CCATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTC CAAA GATGAAGGCG 650 

3 5 GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGAC TGGCAC CATCGAACTG CCGGAAGGCG TAGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CATCCGATCG 8 00 

CRATGGACGA CGGTCTGCGT TTCGCAA 82 7 



40 



55 



2) INFORMATION FOR SEQ ID NO: 164 



(i) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 
TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 
60 AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

230 



CA 02283458 1999-09-28 

• # » * 



AGTTCGTGAA CTTCTGTCTC jgjgggg JSSSK TGAGTGGGAA 
TCGTGCGTGG TTCCGCTCTG ^OGCTGG MgG^^ TTCCGGAACC 
GMGAAAATCA TCGAACTGGC JGGCTWCCTe GAA GACGTATTCT 

AGAGCGTGCG ATTGACAAGC CGTTCCTGt-i AQA GCG CGGTATC 

CCA?C?CCGG TCGTGGTACC ^TGTTACCG OgGTQ^ AGACTCAGAA 
AT C AAAGTGG GCGAAGAAGT TGAAATC^l i TQCTG GACGAAGGCC 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG ACGTGAAGAA 
GTGCCGGTGA GAACGTAGGT GTTCTGCTOC ^ AGC CGCACAC 

ATCGAACGTG GTCAGGTACT GGCTAAGCC-U GftA QGCGGCCGTC 

££££ SSSggc |XT S TT CCgggg 

SSSSSS sees* SSSS» 



200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 



ACGACGG 



2) INFORMATION FOR SEQ ID NO: 165 

, (i) SEQUENCE CHARACTERISTICS: 

} (A) LENGTH: 832 bases 

TYPE: Nucleic acid 
C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIG^ choIeraesuis subsp. choleraesuis 

0 serotype typhi 

( B ) STRAIN: ATCC 10749 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 165 

35 CGGCGCGATC ^^p]E GGG !^ CGTCAGGTAG GCGTTCCGTA CATCATCGTG 
GTGAGCACAT CCTGCTGGGT £GTCAGOTA£ ^ QC XGGA ACTGGT 

TTCCTGAACA AATGCGACAT ^TTGATGAC bftft CCGGGCGACG 
TGAAATGGAA GTTCGTGAAO JTCTCTCTCA ^CGCTGGA AGGCGACGCA 
ACACGCCGAT CGTTCGTGGT TCTGCTCTGA Ag*-^ ATTCTTACAT 

40 GAGTGGGAAG CGAAAATCAT CGAACTGGCT ^g GCTG cCGATCGAAG 
CCCGGAACCA GAGCGTGCGA TTGACAAGCC 6 TCGTGTAGAA 
ACGTATTCTC CATCTCCGGT CGTGGTAC^ ii QTATCAAAGA 

cgcggtatca tcaaagtggg cgaagaagtt c aaactgctgg 

GACTCAGAAG TCTACCTGTA CTGCJXWTOJ ^TGTl TGQTATCAAA 
45 ACGAAGGCCG TGCNGGTGAG AACGTAGGTC 1 G GCACC ATCAA 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG QCl aAAGATGAAG 

ss sss ske* Iggga sssss 

50 gSSSS SSSSS SSSSSS f TACCCTO ATCCACCCGA 
50 ?25^I?gga CGACGGTCTG CGTTTCGCAA tc 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
832 



55 2 ) INFORMATION FOR SEQ ID NO: 166 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

231 



(DJ TOPOL0 ^: Linear 
( 1 i ) MOLECULE TYPE- r 
, Genomic DNA 




35 



10 



f i i ) MOLECULE TYPE- n 

/, : Ge n°mic dna 
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SS5S = SSSES SSS~ A ~ c - 



5 2 ) INFORMATION FOR SEQ ID NO: 168 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 745 bases 

(B) TYPE: Nucleic acid 
1U ( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

fvi 1 ORIGINAL SOURCE : 

(A) ORGANISM: Serratia ma^rcescens 

(B) STRAIN: ATCC 13880 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 

GCCTCAGACT CGTGAGCACA TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT 
TCATCATCGT ATTCATGAAC AAATGCGACA J^TGATGA Wg^GlkCTT 
YTGGAACTGG TAGAAATGGA AGTTCGCGAA ^TGC^ aaagCGCTGG 

25 ccctggcgac ^ctgccgg taatccgcgg ttccgcgc^ c cgaagccctg 

AAGGCGAAGC TGAGTGGGAA JCGAAAATCA CGTTCCTGCT 
GACAGCTACA JCCCAGAGCC AGAGCGTGCT ATCGAC^ GTTGTTACCG 
GCCAATCGAA GACGTATTCT JCATCT^ gCGAAGAAGT TGAAATCGTT 
GTCGTGTTGA GCGCGGCATC ^CAAA^ AC TGGCGTTG AAATGTTCCG 
30 GGTATCAAAG ACACCGTTAA GTCTACCTGT ACTO^ GTTCTGCTGC 
CAAACTGCTG GACGAAGGCC GTGCTGGTGA qqctaagcCA 
GTGGTATCAA ACGTGAAGAA ATCGAACGTG GJCAGGTAC aCATCCTGAG 
GGCTCCATCA AGCCGCACAC JCAGTTCGAA TAC CGTCCAC 
CAAAGATGAA ^GTGGTCGTC ACACKCCATT CTTCAAA^ ^ 

35 SSgtSaga tSgtaaScc Sgcgacaac gtgaacatgg ttgta 



50 
100 
150 
200 
250 
300 
350 
400 
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40 



2) INFORMATION FOR SEQ ID NO: 169 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

e (O STRANDEDNESS: Double 

( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

en (vi) ORIGINAL SOURCE: . 

50 ( A ) ORGANISM: Serratia odonf era 

(B) STRAIN: ATCC 33077 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 
55 GGCGCAATCC TGGTTGTTGC = 



50 
100 
150 
200 
250 
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^GGGAAGC TAAGATTGTA GAACTGGCTG AAGCGCTGGA TTCTTACATC 
CCAGAACCAG AGCGTGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAg£ 
CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGtoS™^ 

SK5KSi T CAAAGTTGGC gaagaagttg aaatcgt?gg ?a??aa?ga? 

^^^^ GT CTACC TGTAC CGGTGTAGAA ATGTTCCGCA AACTGCTOe? 
™ GGCCGT GCTG <3TGAGA ACGTTGGTGT TCTGCTGCGT GGTAtSSS 
G T?AAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG SSaTCaIE 
S CG ^ CACCA AATTCGACTC AGAAGTGTAC ATCCTGAGCA AAgSgSS? 
TGGTCGTCAC ACGCCATTCT TCAAAGGCTA CCGTCCACAG TTCTarTTnp 
GTACTACTGA CGTGACCGGT ACCATCGAAC TGCCAGaS^ SSSSSS 
G TAATGCCAG GCGATAACGT GAACATGGTT SSSJSK? 
CGCAATGGAC GACGGTCTGC GTTTCGCAA ii<-ACCCAAT 



15 



2 ) INFORMATION FOR SEQ ID NO: 170 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 83 0 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: DSM 4540 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 

CGGCGCAATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTC^rrn 

G E? AGCA CAT CCTGCTGGGT CGTCAGGTTG gSttCCTT? StSS^S? ^ 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGr T?r^nS™ 10C 

35 AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TbKSScTO 15C 

ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA aSSSSS? 200 

GAGTGGGAAG CTAAAATCAT CGAGCtSS A??™§S ?on 

CGC ^ CCA gagcgtgcta tcgacaagcc gttcctgctg cxJSSSE lin 

^ ^ GG J A ^ TCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTQTTO?? 

40 CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTGG StaJSIJS? 4 °° 

CACCGTTAAG TCTACCTGTA CCGGCGTTGA AATGTTcSS 450 

ACGAAGGCCG TGCTGGTGAG AACGtcStG JkSSSSSS wESSSSS f°° 

CGCGAAGATA TCGAACGTGG TCAGGTCCTG GCTAAAPPAr* ^SS™!^ 0 550 

« sssss assss ggssl EIE ilF 

SE3S SSS =11 HE S K 

TCGCGATGGA CGACGGCPTa rr^nonA, ^^LLib ATCCACCCAA 800 



50 

2 ) INFORMATION FOR SEQ ID NO : 171 

(i ) SEQUENCE CHARACTERISTICS • 
55 (A) LENGTH: 82 9 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 



830 
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(vi) ORIGINAL SOURCE: „ t Md*~m 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 

GGCGCAATCC TGGTAGTAGC A^ACTGAC OGCCCAATGC CTCAGACCCG 
TGAGCACATC CTGCTGGGCC GCCAGGTAGG CQTWL^i^ qgaacTGGTA 

io tcatgaacaa atgcgacatg otagatgatg aagagctg cc cMQOaAcaJl 

GAGATGGAAG TTCGCGAACT ^TGTCTGCT TA ^ qqcqaAGCCG 
CCTGCCGGTA ATCCGTGGTT CCGCGC1UAA A caGCTACATC 

AGTGGGAAGC GAAAATCGTT GAJCTGGCAG AAGCG^i c^tcgaaGA 

ccagagccag agcgtgctgt agacaagccg ttcctgc gqt cgtgtagagc 

15 CGTATTCTCC ATCTCCGGTC GTGGTACL^i t^^j^Q TATCAAAGAC 
GCGGTATCAT CAAAGTTGGT GAAGAAGTAG JAATCGTAg 
ACCGTTAAGT CTACCTGTAC TGGCGTAGAA ATGTiU qqtatCAAGC 
CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT JCTGCT^ tcaaq 
GTGAAGAAAT CGAACGTGGT JAGGTACTGG gAAGCUft AAGACGAAGG 

20 CCGCACACCC AGTTCGAATC TGAAGTGTAC ATTCl^i ttCTACTTCC 
CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACA^ CGTAGAGATG 
SSSSSS GCGACAACGT GTTACTCTGA TTCACCCAAT 

CGctI?GGAC GACGGTCTGC GTTTCGCAA 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 172 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
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600 
650 
700 
750 
800 
829 



(ii) MOLECULE TYPE: Genomic DNA 

fvil ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydn 

40 ( B ) STRAIN: ATCC 9207 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

CGGCGCGATC YTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC JO 

45 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG gQTTCOBE* CAT^^^ 150 

TTCCTGAACA AATGCGACAT GGTTGATGAC CCGGGCGACG 200 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ^CGACTTC ^ CA 250 

ACACTCCGAT CGTTCGTGGT JCTGCTCTGA AAGCGCT^A ^ 2QQ 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT ^TTCCTG cCGATCGAAG 350 

50 TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTU TCGTGTAGAA 400 

ACGTATTCTC CATCTCCGGT JGTGGTACCG TTGTaA^ GTATCAAAGA 450 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT ^ATCO aaacTGCTGG 500 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGlluu xGGTATCAAA 550 

acgaaggc cg tgctggtgag aacgtaggtg g TCTGctgc q gcaccatcaa 600 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG AAAGATGAAG 650 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA ^^1^1 QTTCTACTTC 70 0 

60 TCGCGATGGA CGACGGTCTG CGTTTC 
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5 



2) INFORMATION FOR SEQ ID NO: 173 



(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A > ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 

20 TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTrr T^rn^^o 

CTGCTGGGTC GTCAGGTAGG CGTTCOsSc mSSoSSS JSS?££? 5 ° 

gssss ss ssssss sS sEsi B 
sgssggg sssss ss sEE 1E^ ? »" 

caaagttggt gaagaagttg aaatcgttg? SKXSSJS gcggtatcat 400 

30 Kg™ ™| 1^ ^SSS is 

SS3E£ sssss? ss 1EE «" 

ACTCCGTTCT TCAAAGGCTO TCStcS wSSg? SSSSJSI 650 

3s SK5SE S££!K sllll EF^ ™ 

GACGGTCTGC GTTTCGCA ^ A1A ^ CTGA TCCACCCGAT CGCGATGGAC 8 00 

818 

40 2) INFORMATION FOR SEQ ID NO: 174 

( i ) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 806 bases 
< B > TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12022 

55 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 

SSS ?SS iggggg ™ CGTGAGCACA 

AAATGCGACA 1*3GTTGATGA GTTCCTGAAC loo 

AGTTCGTGAA CTTCTGTCTC AGTaSS? SSSJ™' 3 TTaA ^aaK 150 

TCGTTGGTGG TTCTGCTCTG iSSSSS 2 °° 



250 
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OCGAAAATCC TGGJJgOOC TgCJgCg Ogggg ggggg 

SEEK ESSSK g||cc ? gg™ jg-ggSS 

ATCAAAGTTG GTGAAGAAGT JG^ATCGXT eSTATCA^ GACGAAGGC C 
5 GTCTACCTGT ACTGGCGTTG JAATGTTCCG CAAA ACGTGAAGA a 
GTGCTGGTGA GAACGTAGGT GTTCTUC-l^ -q^^ca AG CCGCACAC 
ATCGAACGTG GTCAGGTACT GGCTAAGCCG g^gg^ qqcggmCGTC 
CAAGTTCGAA TCTGAAGTGT ^ATTCTGTC CMAfiM^TT CCGT ACTACT 
ATACTCCGTT CTTCAAAGGC TACCGTCC^C A TGGTAATGCC 

10 KCAAC ESSKi XScCACCCG ATCGCGATGG 800 



ACGACG 



15 2 ) INFORMATION FOR SEQ ID NO: 175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
20 (C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi 1 ORIGINAL SOURCE: 

(A) ORGANISM: Shigella soimei 

(B) STRAIN: ATCC 29930 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 

CGGCGCGATC CTOGT^TTG ^ egCCCgg SgSSSS 

SSSSSK SS gmSgc aggjcgc jgggg 

TGAAATGGAA ^TCGTGA^C TTCTGTCTCA ^CGACT^ 
ACACTCCGAT CGTTCGTGGT TCTGCT^iUA aa AT TCTTACAT 
GAGTGGGAAG CGAAAATCCT ^^TGGCT GGCnLL CCGATC GAAG 
TCCGGAACCA GAGCGTGCGA TTGACAAGCC gTCCTGCTG 
ACGTATTCTC CATCTCCGGT JGTGGTACCG Tlji GTATCAAAGA 
CGCGGTATCA TCAAAGTTGG JGAAGAAGTT ™gu aaactGCTGG 
GACTCAGAAG TCTACCTGTA ^TGGCGTTGA XGGTATC AAA 

acgaaggccg tgctggtgag ^gtaggtg ttctgctg gcaccatcaa 

CGTGAAGAAA TCGAACGTGG JCAGGTACTG ^AAG 

45 sssss ssss| sgsg, sssss sees 

SS SS SSSSSS ATCCCCCO, 
??G?S?GGA CGACGGTCTG CGTTTCGCAA TC 



25 



30 



35 



40 
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2) INFORMATION FOR SEQ ID NO: 176 



(i) SEQUENCE CHARACTERISTICS: 
„ (A) LENGTH: 716 bases 

( B ) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

237 
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5 



10 



15 



20 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 13301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 

ESSS SggEgs SSK2 ESS?" 50 

TGGTTGACGA TGAAGAATTA TTAGAATTAC tSSIT**^ AAAGTTGACA 100 

ssss ssss S S Sees - 

TAGAATTAAT GGAAGCTGTA GATACTTACA ™S GAAAAAATCT 250 

TCTGACAAAC CATTCATGAT GCCActSS? ™2^2*£ AGAACG TGAT 300 

TCGTGGTACT GTTGCTaSS SS35SSX SSSSES 350 

GTGAAGAAGT TGAAATCATC rrTTTa™™ ^,2 GGTCAA ATCAAAGTTG 400 

ACAGGTGTTG 2S2SSS 2S5K£E£ AACAACTGTT 45S 

ssss sggg ™1 I IE gasss a; 

sssss jssse ssgss «« 

GTGTTGTTCA CTTACC AATTCTATTT CCGTACTACT GACGTAACTG 70 0 



2) INFORMATION FOR SEQ ID NO: 177 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 719 bases 
30 (BJ TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



40 



45 



50 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 2 9247 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 

SSS3S JS2SSS2 S3KE a a ttcttaaac - 

AGTTCGTGAC TTATTAAGCG AAT^^S SSSSESS ^™ TGGA "0 

TAATCGCTGG TTCAGCATTA AAAGCTTTAr SSSSJSSH GATGTACCT G 150 

GAAAAAATCT TAGAATTAAT ?SSS» AAGGCGATGC TCAATACGAA 200 

AGAACGTGAT TCTGACAAAC TTCCAACTGG 2sS 

CAATCACTGG TCGTGGTACT GTTgSISJ JSSSF 3 * 0 <»COTATTCT 300 

ATCAAAGTTG GTGAAGAAGT S2S5SSS SSJS ACGTGGTCAA 350 

AACAACTGTT ACAGGTGTTG AAATCT^S SS2^? ACACATCTAA 400 

AAGCTGGTGA CAACATTGGT GCATTaS? l^lr^nn GACTACGCTG 450 

GTACAACGTG GTCAAGTATT AGCTGrrrrT ^™2 T ° TTGC TCG TGAAGAC 50 0 

TGAATTCAAA GCAGAAGTAT ACgSSSS SSS*^* CACCACAT AC 550 

ACACTCCATT CTTCTCAAAC TATCCtS^ AAAA ^ ACGAA GG TGGACGTC S0 0 

GACGTAACTG GTGTTGTTCA otSSJ? ^Z^^ CCG TACTACT 6 so 

TGGTGATAAC GTTGAAATG CTTACCAGAA GGTMCTGAAA TGGTAATGCC 700 



50 



60 2) INFORMATION FOR SEQ ID NO: 178 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(B) STRAIN: ATCC 33591 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 

15 GTGAACACAT ^T|EE^jj^j^ GGTT^ACGAT GJ^GAATTAT TAGAATTAGT 
TTCTTAAACA AAGTTGACAT ^TGACGAT GAA CCAGGTGACG 
AGAAATGGAA GTTCGTGACT JATTAAGCUA ^ AGGCGATGCT 
ATGTACCTGT AATCGCTGGT TCAGCATTAA JJJCTT^ atacttacat 

20 CAATACGAAG AAAAAATCTT AGAAliAAJ^ ATTCATGATG CCAGTTGAGG 
TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG ^ 
ACGTATTCTC AATCACTGGT £GTGGTACTG ™™ GTTTACATGA 
C GTGGT CAAA TCAAAGTTGG JGAAGAAGTr TTCCQT aaaTTATTAG 

CACATCTAAA ACAACTGTTA lj^^^^ n&TTATTACG TGGTGTTGCT 

25 ACTACGCTGA AGCTGGTGAC AACATTGGTG ^TATTACG 

3SES5S SSraSS SSaagtSa CGTATTATCA AAAGACGAAG 
GTGGACGTCA CACTCCATTC TTCTC 



2) INFORMATION FOR SEQ ID NO: 179 

( i ) SEQUENCE CHARACTERISTICS : 
-.c- ( A ) LENGTH: 704 bases 

( B ) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(\ri ) ORIGINAL SOURCE : 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 43300 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 

sssss « s Si US 

50 GCGAATATGA CTTCCCAGGT ^GATCTAC TCTTAGAATT 
TTAAAAGCTT TAGAAGGCGA TGCTCAA1AL. ^^AACGT GATTCTGACA 
AATGGAAGCT GTAGATACTT ACATTCCAAC TCCAGAACGT 

s HE |K T ™ ? ssssss 

SSSSci CCTGGTTCAR TTACACCACA ™CT???™? iJJSSSS 

.. SSSSS SSiSSS SSKKS 2SSSS SSSSS 
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TCACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGTGAT AACGTTGAAA 700 

704 

5 

2) INFORMATION FOR SEQ ID NO: 180 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 73 0 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY; Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 6538 aureus 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 

SSiSSiE ESSES S5 SSSSS £25523 50 

25 sssssi sss sssss EEi Si? II s 
£S| ssjsss sssss s™ Si 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG COsKS??? 12° 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCAtS 350 

3 0 CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT 400 

ACTACGCTGA AGCTGGTGAC AACmSSSS SS J S ° 

CGTGAAGACG TACAACGTGG TCAAGTATTA rrTrPTP^n ™™ TTGCT 500 

ACCACATACT GAATTCAAAG SSXSJS S^SSS 550 

GTGGACGTCA CACTCCATTC TTCTCAAACT ATrrrnSiSf AAAGACGAAG 600 

35 CGTACTACTG ACGTAACTGG iSSSJJSS 650 

GGTAATGCCT GGTGATAACG TTGAAATGAC? TTACCAGAAG GTACTGAAAT 700 

730 

40 2) INFORMATION FOR SEQ ID NO: 181 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 834 bases 
<B) TYPE: Nucleic acid 
45 (O STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN: ATCC 33753 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 



55 



60 



SSSSSS ZSggESi S?SS£S SEKjES ?***»"*c 
SS5S3S SSSSS S IIP- SSSSS iSS 

ATGTACCTGT AATCTCTGOT ffiSSK 2 °° 
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GAATACGAAC AAAAAATCTT AGACTT^TG CAACAAGTTG ACGATTACAT 
TCCAACTCCA GAACGTGACT £TGATAAACC ATI CCGTGTTGAA 
ACGTATTCTC AATCACTGGT CGTGGTACTG TT^ GTATGAAAGA 
CGTGGTCAAA TCAAAGTCGG TGAAGAAU1 i c CGTAAATTA T 

5 CGGTTCACAA AA^CAACAG JJACTGGTGT AGAAATGT^ 

TAGACTACGC TGAAGCTGGT GACAACAi^ TTAGCTGCTC CTGGTTCAAT 
TCACGTGAAG AAGTACAACG TGGTCAAGT1 TCTAAAGATG 
TACACCACAC ACTAAATTCA CTGCAGAAGT JTACGTATTA TTCTAT 
AAGGTGGACG TCACACTCCA TTCTTCTCTA ACTA<£ j^ GGTACAGK 
10 ES iSSESS |LgTTGAA TTAATTTCTC BOO 

CAATCGCTAT CGAAGACGGT ACTCGTTTCT CAAT 
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2) INFORMATION FOR SEQ ID NO: 182 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



25 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(Vi> <£r N £sSr staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 

SSSS SSSSK ssssg « S 

T SIEJ!S ottcgtoS? ?aSaag?ga SKtgacttc ccaggtgatg 

TGAAATGGAA GT TCGTGALi AAGCTTTAGA AGGCGATGCT 

atgtacctgt aatcgctggt tcagcattaa aagctt gttg atqattacat 

CAATACGAAG AAAAAATCTT ATTCATGATG CCAGTTGAGG 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG gCA^^ 
ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCT GTATCCACGA 
CGTGGTCAAA JCAAAGTTGG gAAGAAGTT GAAA ^ 

AACTTCTAAA ACAACTGTTA ^^^^ rTTTATTACG TGGTGTTGCT 
ACTACGCTGA AGCTGGTGAC AACATCGGTG RTTTATTACG 
CGTGAAGACG TACAACGTGG TCAAGTAilA £ AAAGACGAAG 
ACCACACACT AAATTCAAAG CGGAAGTTTA OOTTT1 ATTCTATTTC 
GTGGACGTCA CACTCCATTC JTCAGTAACa ^ GTACTGAAAT 

SSSSS? SSSSE agttgaatta atcgctccta 

?C™A??GA AGACGGTACT CGTTTCTCAA TCGGA 
2 ) INFORMATION FOR SEQ ID NO: 183 



( i ) SEQUENCE CHARACTERI STI CS : 
55 (a) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Staphylococcus caseolyticus 

(B) STRAIN: ATCC 13548 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 

GTATCTTAGT AGTATCTGCT GCTGACGGTC CAATGC CACA AACTCGTC A A 
CACATCCTTT TATCACGTAA CGTTGGTGTA CCAGCATTAG SSaTTOT 
^^ AAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGtJSaI 
n^^ TTCG TGACT TATTA TCTGAATATG ACTTCCCTGG ScGmSJa" 
CCTGTAATCG CTGGATCTGC TTTAAAAGCA TTAGAAGGCG tStS?!?™ 
ATCATGGAAT TAATGGACGC AGTTGATGAG 
c S™ AGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 
^™ TCA CT °GTCGTGG TACAGTTGCA ACTGGACGTG SSftGCGTOG* 
ACAAGTTAAA GTTGGTGAAG AAGTTGAAAT CATTGGTTTA ACTGaSKS 
CAGCAAAAAC TACAGTTACA GGTGTAGAAA TGTTCCGTAA AtSS^SSS 
TACGCTGAAG CTGGAGATAA CATCGGTGCT TTATTa5gTC OTgSc?^ 

o SSSSiSE caacgtggac aagtattagc taaaccagg? SSSSS 

0 CACATACTAA ATTCAAAGCT GAAGTTTACG TATTA tttis a^VVo^^S 
SSSS™* CTCCATTC TT CAC?Al??A? SSSct 
TACAACTGAC GTAACTGGTG TAGTTAACTT ACCAGAArrT £^^.££2 
TAATGCCTGG AGATAACATC GAAATGaS TTCTCCAATC 



35 



2) INFORMATION FOR SEQ ID NO: 184 

3 0 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
40 (B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 184 



TTCTTAAACA AA5iiaACA? 55SSS^? SJSSS? tIS?SS 100 

2SSKSSJ Eggg SS SE IIS *" 

=o SKSSSS - 

=s isssss US L s? ~ - 

AGATTCAAGC AAAACAAcfS SSiSSJS 450 

55 ssssss 2ss sss izsii sss 

ssssss ?gffisa Si ssssgs g« 

ttccgtacta SSES££ aS?SSSS JS?£SS at 70 ° 

so SS2S5SS~SS 5SS 

Q "3 ^ 
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2) INFORMATION FOR SEQ ID NO: 



185 



10 



20 



25 



35 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 699 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

1S (-i)^ GIN ^SSf- Staphylococcus epidermidis 
(B) STRAIN: CSG 2 69 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 
ATCTGCTGCT GACGGTCCAA TGC^CAAAC TCGTGAACAC ™TTAT 
CACGTAACGT TGGTGTACCA ^ATTAGTTG Jg«TT 
ATGGTAGA.CG ACGAAGAATT ^TTAGAATTA CCT g-jAATCGCTG 

CTTATTAAGC GAATATGACT CTGAATACGA ACAAAAAATC 

gttctgcatt aaaagcatta ^aggcgatg ctgaata CAG ^ CGTGh 

TTAGACTTAA TGCAAGCAGT JGATGATTAC Al i£ TCAATCACTG 

TTCTGACAAA CCATTCATGA TGCCAGTTGA JJACGTA QTT 
GTCGTGGTAC TGTTGCTACA G^GTGTTG AACGg 

GGTGAAGAAG TTGAAATCAT CGGTATGCAC J^lAOOCT GAAGCTGGTG 

30 TACTGGTGTA GAAATGTTCC GTAAAiiA GGTACAACGT 

ACAACATCGG TGCTTTATTA JGTGGTGTTG ^™ CAAAATTCAA 
GGTCAAGTAT JAGCTGCTCC TGGTTCTATT ACACCACA^ 

SSSSSS CTA^CCA Sa^SaTT TCCGTACTAC TGACGTAAC 



2) INFORMATION FOR SEQ ID NO: 186 

40 (i) SEQUENCE CHARACTERISTICS: 

* (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINA^URCE : haemolyticU s 
50 ( B ) STRAIN: ATCC 29970 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 

C =S? SSSSS 3SS3S sssssss sskes 

55 HIE 1=1 S55g£ SSS 

TGAAATGGAA fTACGTGACT TAiiJi aaqctTTAGA AGGCGATGCT 

ATGTACCTGT AAJCGCTGGT JCAGCATTAA TQ ATGACTACA T 

60 T^AC?C^ S^gS ESJSKcC ATTCATGATG CCAGTTGAGG 

243 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
699 



50 
100 
150 
200 
250 
300 
350 
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ACGTATTCTC AATCACTGGT 
CGTGGGCAAA TCAAAGTTGG 
CACTTCTAAA ACAACTGTTA 
ACTACGCTGA AGCTGGTGAC 
5 CGTGAAGACG TACAACGTGG 
ACCTCACACA AAATTTAAAG 
GTGGAC GTCA CACTCCATTC 
CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGCGACAACG 
10 TCGCGATTGA AGACGGTACT 



CGTGGTACTG TTGCTACAGG 
TGAAGAAGTT GAAATCATTG 
CTGGTGTAGA AATGTTCCGT 
AACATCGGTG CATTATTACG 
TCAAGTATTA GCTGCTCCAG 
CAGACGTATA CGTTTTATCT 
TTCACAAACT ATCGTCCACA 
TGTTGTTAAC TTACCAGAAG 
TTGAAATGAC AGTAGAATTA 
CGTTTCTCA 



CCGTGTTGAA 4 00 

GTATC CATGA 4 50 

AAATTATTAG 500 

TGGTGTTGCT 550 

GTTCAATCAC 600 

AAAGACGAAG 650 

ATTCTATTTC 70 0 

GTACTGAAAT 750 

ATCGCTCCTA 800 
829 



15 



2) INFORMATION FOR SEQ ID NO: 187 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 < A > ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 123 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

30 ^^ CTCG TGAACACATT CTTTTATCAC GTAACGTTGG TGTACCAGCT c n 
TTAGTTGTAT TCTTAAACAA AGTTGATATG GTAGACGACG AAGAATTATT inn 
^TTAGTA GAAATGGAAG TTCGTGACTT ATTATCTGAA TATG^StcJ lln 
CAGGTGACGA CGTACCTGTA ATCGCTGGTT CAGCATTAAA AGCTTTAGAA 
GGCGACGAAA AATACGAAGA AAAAATCTTA GAATTAATGC ^gSg™ lln 
TGACTACATT CCAACTCCAG AACGTGATTC TGACAAACCA TTcSSES 
GA £^ GAGGA CGTATTCTCA ATCACTGGTC GTGG^Jot lln 
^TGTTGAAC GTGGTCAAAT CAAAGTTGGT GAAGAAGtJg AAA^Sg 
TTTACATGAC ACTTCTAAAA CAACTGTTAC TGGTGTAGAA 

AGTTATTAGA CTACGCTGAA GCTGGTGACA ACATCGGTGC iSISSS 1 5 ° 

EK™™ GTGAAGACG * ACAACGTGGT cSSaSaS 5 °° 
l!n^ TACA G CACATACAA AATTCAAAGC GGAAGTTTAC mSSSS 
™? GAAGG TGGACGTCAC ACTCCATTCT TCAGTAACTA CCGcSIS 
?SS TTTCC GTACTACTGA CGTAACTGGC GTToSSSi SSSSSS 



35 



40 



550 
600 
650 
700 

45 705 



2) INFORMATION FOR SEQ ID NO: 188 

50 (i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 678 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
60 (B) STRAIN: CSG 23 7 

244 
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15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: l.B 

TTTTATCACG T^T^CT ggggX =S 100 

sssss S ssssss essk 

SS3SS iSSSSS og^ggc cggo^j III 

ATTTGACTCT GATAAAC CAT TCATGATGCC ^Gl^ TGGTCAAATC 350 

?CA??GG?CG TGGTACTGTT GCTACAGGTC ^GTTGAACG ^ 400 

AAAGTTGGTG AAGAAGTTGA AATTATTbUi ^ TACGCTGAAG 450 

aATTGTTACT GGTGTAGAAA TGTTCCGTAA ATiAiij TGAAGATGTA 500 

SSgAcS CATCGGTGCT ™™™ ££SSS CTCACACAAA 550 

CAACGTGGTC AAGTATTAGC TGCTCCAGGT GGACGTCATA 600 

SSSSSS SSaaSS cScScAAT TCTATTTCCG TACTACTGAC 650 

otSctgoto ttgttaactt ACCAGAAG 



2) INFORMATION FOR SEQ ID NO: 18 9 

( i ) SEQUENCE CHARACTERISTICS: 
( A Y LENGTH: 668 bases 
in) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL^URCE: staphyiococcus haemolyticus 
( B ) STRAIN: CSG 33 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 

ACCAGCATTA GTAGTATTCT TAAATA^GT TGACATGGTT GACGAT^G 
SggSS SSSSSS S|C OCTGGTTCAG CATTAAAAGC 200 

40 SSgSSgc gatgctcaat acgaagaaaa aatcttagaa ^ 2 50 

CAGTTGATGA CTACATTCCA ACTCCAGAA^ ^ TCGTG qTACTGTTGC 300 

Sgatgccag ttgaggacgt attctcaatc actggtcgt^ 350 
?acag3ccgt gttgaacgtg gtcaaatcaa agttggtgaa gaa ^ tq 400 
?cStggtat ccatgacact tctaaaacaa ctgttactgg gcatt 45 o 

45 TTCCGTAAAT TATTAGACTA CGCTGAAGCT ^GACAA^ gtattagctg 500 

ss^ ss^ ssssa ssssss ss 

SEEKS iSSSSg SSSSS SSSSSSt. GTTAACTTAC 650 
50 CAGAAGGTAC TGAAATGG 



2) INFORMATION FOR SEQ ID NO: 



190 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



245 
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10 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE • 

si SET^r r yi ™ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 

SSggS SEEKS ttagaattag " GAAATGGA 

taatcgctSS SsSSS? iiSSiSS SSSS™ «™accra i 

GAAAAAATCT TAGAATtSt SJSS? 3 *" 30 TCAA TACGAA 150 

AGAACGTGAT TCTGACAAAC S^SXJ GATGATTACA TTCCAACTCC 200 

CAATCACTGG TCGTGGTACT JESS?™* GA CGTATTCT 25S 

ATCAAAGTTG GTOuESS ACGTGGTCAA too 

AACAACTGTT ACTGGTGTAG AAATCTTC?? Sa^™ ACACTTCTAA 350 

AAGCTGGTGA CAACATTGCT 33SS3SS SS* 06 ™ 4 °° 

GTACAACGTG GTCAAGTATT AGCTGCTr™ SSS^JT 30 TCGTG AAGAC 4 50 

AAAATTTAAA <X3U»SKi JSJStoS CACCTCAC AC 500 

ACACTCCATT CTTCACAAAC tSSJSE SJJSSg gS^^ 550 



50 



25 2) INFORMATION FOR SEQ ID NO: 191 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 828 bases 
3n TYPE: Nucleic acid 

J ° C STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE : Genomic DNA 
35 (vi) ORIGINAL SOURCE- 

S) ^Tl^gfiZr™*" subsp. * omijlis 

4Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 

SS %Z3£3£ SSSSS S 0 *^™ ccacaaactc 

TTCTTAAACA AAGTTGACAT GTGTACC AGC ATTAGTAGTA X 00 

TGAAATGGAA GTj£S33? SSSgS? ESSE*** TAGAA TTAGT Jgg 

45 ACGTACCTGT AATCGCTGGT TCaSS?? A * AC £ ACTTC CCA GGTGACG 200 

CAATACGAAG AAAAAATCTT AGaSJaI^ -a™™^ AGG CGATGCT 250 

TCCAACTCCA GAACGTGACT CTGATAAAPP ^S^* 0 ATGATTATAT 360 

ACGTATTCTC AATCACTGGT CCTG^TA^SS 5SS^ TC CCAGT TGAGG 350 

CGTGGTCAAA TCAAAGTTrr t™^* AC I G ^TGCTACAGG CCGTGTTGAA Inn 

50 AACTTCTAAA ACAACTGTTA CTGGTOTAGA ajTrTrn^^* G TATCAAAGA Jgg 

ACTACGCTGA AGCTGGTGAC SStcS^ ™^ GGT ^TTATTAG 500 

CGTGAAGATG TACAACGTGG TCAActSJ? fJSSJS 03 TGG TGTTGCT §50 

ACCTCACACA AAATTCAAAG CAGACGTAT^ ™^ CCAG GTTCAATTAC S0 0 

GTGGACGTCA TACTCCATTC SSS^nS CGTTTTATCA AAAGATGAAG 650 

55 CGTACTACTG ACGTAACTGG wSSSJS JSSSS* 01 ATTCTATTTC 70S 

GGTAATGCCT GGTGACAACG TTOuSESiS 1™™°^° ^TACTGAAAT 750 

TCGCGATTGA AGACGGTACT CGT^£tC AGTAGAA TTA ATCGCTCCTA 80S 



60 
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2) INFORMATION FOR SEQ ID NO: 192 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warnen 

(B) STRAIN: ATCC 35982 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 



ATGGTC CAAT 
GGTGTACCAG 
CGAAGAATTA 

2 0 AATATGACTT 
AAAGC TTTAG 
GCAAGCAGTT 
CATTCATGAT 
GTTGCTACAG 

25 TGAAATCATC 
AAATGTTCCG 
GCTTTATTAC 
AGCTGCTCCT 
ACGTTTTATC 

30 



GC CAC AAACT 
CTTTAGTTGT 
TTAGAATTAG 
CCCAGGTGAC 
AAGGCGACGA 
GATGACTACA 
GCCAGTTGAG 
GCCGTGTTGA 
GGTTTACATG 
TAAGTTATTA 
GTGGTGTTGC 
GGTTCAATTA 
TAAAGACGAA 



CGTGAACACA 
ATTCTTAAAC 
TAGAAATGGA 
GACGTACCTG 
AAAATACGAA 
TTCCAACTCC 
GACGTATTCT 
ACGTGGTCAA 
ACACTTCTAA 
GACTACGCTG 
TCGTGAAGAC 
CACCACATAC 



TTCTTTTATC 
AAAGTTGATA 
AGTTCGTGAC 
TAATCGCTGG 
GAAAAAATCT 
AGAACGTGAT 
CAAT CACTGG 
ATCAAAGTTG 
AACAACTGTT 
AAGCTGGTGA 
GTACAACGTG 
AAAATTCAAA 



ACGTAACGTT 50 

TGGTAGACGA 100 

TTATTATCTG 150 

TTCAGCATTA 2 00 

TAGAATTAAT 250 

TCTGACAAAC 3 00 

TCGTGGTACT 350 

GTGAAGAAGT 4 00 

ACTGGTGTAG 4 50 

CAACATCGGT 500 

GTCAAGTATT 55 0 

GCGGAAGTTT 600 
620 



2) INFORMATION FOR SEQ ID NO: 193 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 92 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

45 (B) STRAIN: CSG 170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 

CCAGCATTAG TAGTATTCTT AAACAAAGTT GACATGGTTG ACGATGAAGA 50 

50 ATTATTAGAA TTAGTTGAAA TGGAAGTACG TGACTTATTA TCTGAATACG 100 

ACTTCCCAGG TGACGACGTA CCTGTAATCG CTGGTTCAGC TTTAAAAGCT 15 0 

TTAGAAGGCG ATGCTCAATA CGAAGAAAAA ATCTTAGAAT TAATGCAAGC 2 00 

AGTTGATGAT TATATTCCAA CTC CAGAACG TGACTCTGAT AAACCATTCA 250 

TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT 300 

55 ACAGGCCGTG TTGAAC GTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT 3 50 

TATTGGTATC AAAGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT CGGTGCTTTA 450 

TTACGTGGTG TTGCTCGTGA AGATGTACAA CGTGGTCAAG TATTAGCTGC 500 

TCCAGGTTCA ATTACACCTC ACACAAAATT CAAAGCAGAC GTATACGTTT 550 

6 0 TATCAAAAGA TGAAGGTGGA CGTCATACTC CATTCTTCTC TAACTATCGT 60 0 

247 
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CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 
AGAAGGTACT GAAATGGTAA TGCCTGGTGA CAACGTTGAA AT 692 . 



2) INFORMATION FOR SEQ ID NO: 194 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 684 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 36 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 

CATTCTTTTA TCACGTAACG TTGGTGTACC AGCATTAGTA GTATTCTTAA 50 

ACAAAGTTGA CATGGTTGAC GATGAAGAAT TATTAGAATT AGTTGAAATG 100 

GAAGTACGTG ACTTATTATC TGAATACGAC TTCCCAGGTG ACGACGTACC 150 

25 TGTAATCGCT GGTTCAGCTT TAAAAGCTTT AGAAGGCGAT GCTCAATACG 200 

AAGAAAAAAT CTTAGAATTA ATGCAAGCAG TTGATGATTA TATTCCAACT 250 

CCAGAACGTG ACTCTGATAA AC CATTCATG ATGCCAGTTG AGGACGTATT 3 00 

CTCAATCACT GGTCGTGGTA CTGTTGCTAC AGGCCGTGTT GAACGTGGTC 350 

AAATCAAAGT TGGTGAAGAA GTTGAAATTA TTGGTATCAA AGAAACTTCT 400 

30 AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT TAGACTACGC 450 

TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT GCTCGTGAAG 500 

ATGTACAACG TGGTCAAGTA TTAGCTGCTC CAGGTTCAAT TACACCTCAC 550 

ACAAAATTCA AAGCAGACGT ATACGTTTTA TCAAAAGATG AAGGTGGACG 600 

TCATACTCCA TTCTTCTCTA ACTATCGTCC ACAATTCTAT TTCCGTACTA 650 

3 5 CTGACGTAAC TGGTGTTGTT AACTTAC CAG AAGG 684 



40 



2) INFORMATION FOR SEQ ID NO: 195 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
50 (A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195 

55 ACCAGCATTA GTAGTATTCT TAAACAAAGT TGACATGGTT GACGATGAAG 50 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGACGT ACCTGTAATC GCTGGTTCAG CTTTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 2 00 

CAGTTGATGA TTACATTCCA ACTCCAGAAC GTGACTCTGA TAAACCATTC 250 

60 ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 3 00 

248 
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> • * » 

TACAGGCCGT GTTGAACGTG GTC^CAA AGTTGGTGAA 
TTATTGGTAT CAAAGAAACT JCTAAAACAA ^ TCGGTGCTTT 
TTCCGTAAAT TATTAGACTA CGCTGAAGCT £ GTATTAGCTG 
ATTACGTGGT GTTGCTCGTG AA^JGTACA CGTATACGTT 
CTCCAGGTTC AATTACACCT CACACAAAA1 J CTAACTATCG 
SSSSSS SSE5 AACTGGTGTT GTTAACTTAC 
SSSSS Saaatggta ATGCCTGGCG ACAAC 



2) INFORMATION FOR SEQ ID NO: 196 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 611 bases 
i:> (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

<Vi) X^gS? ; Staphylococcus hominis 

(B) STRAIN: CSG 62 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

GACTTATTAT CTGAATACOA CTTCCCAGGT GACGACGTAC C^TAATCGC 

SK2K SSS SSSKk GAGGACGTAT S33K 
GACTCTGATA AACCATTCAT GATGCCAGTT CAAATCAAAG 
TGGTCGTGGT ACTGTTGCTA CA^CGTGT ™^ TAAAACAACT 
TTGGTGAAGA AGTTGAAATT ATTGGTATCA *M*™= CTGAAGCTGG 
GTTACTGGTG TAGAAATGTT JCGTAAAT1A TGCTCGTGAA GATGTACAAC 
35 TGACAACATC GGTGCTTTAT TACGTGGTGT TGCTCGi ^ca^tIC 
GTGGTCAAGT ATTAGCTGCT JCAGGTTCAA TCACACU GTCATACTCC 
AAAGCAGACG JATATGTTTT ATCAAAAGAT ACTGACGTAA 

SSSSSS SSSSEw aaatggtaat gcctggcgac 

4 0 AACGTTGAAA T 



25 



30 



45 



2) INFORMATION FOR SEQ ID NO: 197 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 ( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



55 



<Vl,< ?S GI M™Sf Staphylococcus lusduncsis 
(B) STRAIN: ATCC 43809 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 
60 CGGCGGTATC TTAGTAGTTT CTGCTGCAGA TGGTCCAATG CCACAAACTC 



350 
400 
450 
500 
550 
600 
650 
685 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
611 



50 
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20 2) INFORMATION FOR SEQ ID NO: 198 

( i ) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 690 bases 
J B > TYPE: Nucleic acid 
25 (O STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 



35 



40 



50 



2) INFORMATION FOR SEQ ID NO: 199 

55 (i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 723 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



100 
150 
200 



^2^ CACAT T CTTTTATCA CGTAACGTTG GTGTGCCAGC ATTAGTAGTA 
TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAaSSt 
^^ GGAA GTTC GTGATT TATTAAC TGA ATATGACTTC CC^^ScG 
ATGTGCCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAra arr^^n 

JSSS™ CTAAAAT "T AGAATTAATG ^gSotg J£2S^ Itn 
TCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATG cSrSrSS 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG cSSSSS? 35 ° 

CGTGGT CAAA TCAAAGTCGG TGAAGAAGTT GAAATTATTG SSSSSff 4 °° 

TACTACTAAA ACAACTGTTA CTGGTGTAGA StS??SSt SiSSSS 45 ° 

ACTACGCTGA AGCTGGTGAC AACATCGGTG 2SS5SS 55SS™?? SS 

CGTGAAGATG TACAACGTGG ACAAGTATTA GCTGCTCCAG 55 ° 

ACCTCACACT AAATTTAAAG CTGACGTATA TOTTTTATOT SSSJS?^ 6 °° 

szssss ssss sssss illl esses «• 

ssgsss sggssg sgjgr ssss sssssss 750 



800 
828 



(A) ORGANISM: Staphylococcus saprophytics 

(B) STRAIN: ATCC 35552 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 

AGTAGTATCT GCTGCTGATG GCCCAATGCC ACAAAPTrv-T 
TTTTATCACG TAACGTTGGT GTTCCAGCAT iSSJJSSS 

GTTGACATGG TTGACGATGA AGAATTATTa n^l^r^l CTTAAACAAA 100 

il=iS™ilI== is 

AACAACTGTT ACTGGTGTAG AA^SS? ™S5S£? AATCAAGCAA 450 

AAGCTGGTGA CAACATTGGT SJ2SJSE£ GACTACGCTG 500 

GTACAACGTG GTCAAGTTTT AGCTGCTf"CT SSSSJH? ACGTG ATGAC 550 

AAAATTCAAA GCGGATGTTT JSSSSS? SSJSSSS' ^^CATAC 600 
ATACACCATT CTTCAClSS TACOTCCCAC XSJSSJi GOTOOTCOTC 



50 



650 
690 



250 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi ) ORIGINALJOURCE : staphylococcus saproP hyticu S 
5 ( B ) STRAIN : CSG 83 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 
GCATTAGTTG TATTCTTAAA CAAAGT-AC ATGGTTGACG ATGJACAATT 
10 ATTAGAATTA GTAGAAATGG AAGTTCGTGA TTT ^ 

TCCCAGGTGA CGATGTACCT GTAATCOOl TTAGACTTAA TGCAAGCTGT 
GAAGGCGACG CTGACTATGA gAAAAAATC TTCTGACAAA CCATTCATGA 
TGATGACTTC ATTCCAACAC CAGAA^ GTCGTGGTAC TGTTGCTACA 
TGCCAGTTGA GGACGTATTC TCAATCACTJ ( ^ T CGAAATCAT 

15 GGCCGTGTTG AACGTGGTCA ^CAAAGTC GGT GTAGAAATGT 
CGGTATGCAA GAAGAATCAA G£AAAACAAL £ TGGTGCATTA 
TCCGTAAATT ATTAGACTAC ^TGAAGCTG ^ TTTTAGCTGC 
TTACGTGGTG TTTCACGTGA TGACGTACAA CGTG GTTTACGTTT 
TCCTGGTACT ATTACACCAC JTACAAAATT UW£ q taactaccgc 

20 TATCTAAAGA TGAAGGTGGT CGTCATACAC Q XTAACTTACC 

SSSSSS tSSa TAACGTTGAA ATGGATGTTG 

AATTAATTTC TCCAATCGCT ATT 



25 



2) INFORMATION FOR SEQ ID NO: 200 



(i) SEQUENCE CHARACTERISTICS 
30 (A) LENGTH: 697 bases^ 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
723 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

,vi) ^""EsKS? ' staphylococcus ea p~phyticus 

(B) STRAIN: CSsa 18 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200 

CGTTGGTGTT CCAGCATTAG TTGTATTCTT AAACAAAGTT SggSS 
ACGATGAAGA ATTATTAGAA "AGTAGAAA ^ C TGGTTCTGC 

45 AGCGAATATG ACTTCCCAGG TGACGATGTA CCT^^ ATCTTAGACT 
ATTAAAAGCT ™GAAGGCG ACGCTGAC1A CG TGATTCTGAC 

TAATGCAAGC TGTTGATGAC TTCATTCCAA u tca CTGGTC GTGG 

AAACCATTCA TGATGCCAGT TGAGGACGTA TTCTCAA GTCGGTGAAG 
TACTGTTGCT ACAGGCCGTG TTGAACGTGG TC^ATCAAA 

50 AAATCGAAAT CATCGGTATG CAAGAAGAAT CA££ CTGGTGACAA 
GGTGTAGAAA TGTTCCGTAA ATTATTAGAC TAg CAACGTGGTC 
CATTGGTGCA TTATTACGTG GTGTTTCACU QM ATTCAAAGCG 

AAGTTTTAGC TGCTCCTGGT ACTATTACAC CACATACAAA CTT 

Satgtttacg ttttatctaa agatgaaggt g^gtcata ggtg 650 

55 SSSSSS SSSSS SSSCCTGG CGATAAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 



60 



2) INFORMATION FOR SEQ ID NO: 201 

251 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
10 ( vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus sciuri subsp. sciuri 

(B) STRAIN: ATCC 29060 



15 



20 



35 

2) INFORMATION FOR SEQ ID NO: 202 

( i ) SEQUENCE CHARACTERISTICS • 
< A ) LENGTH: 831 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 
CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCH JX A Tr» Pn^^nmn 

™S CACAT tctttt *tca cgtaa?SSS SSSSJS? 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT SJJ?S^^ 100 

tgaaatggaa gttcgtgact tattatctga atatSJS?? 150 

ACGTTCCTGT AATTGCTGGT TCAGCAtJS aSSSS 2 °° 

5K2S5£? ACAAAA ™ GGAATTAATG SJSSSSJS JSSJSSS nn° 

CCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATP ^ptS 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTgSIpJJS ^2^ GAGG 350 

CGTGGTCAAA TCACTGTTGG toSS^S SS^2g SSSSSJf 4 °° 

AGAATCTTCT AAAACAACTG TAACTGGTGT TGAAA^C SSJiSS 45 ° 

ss sssss =s Si" »«° 

2SSSSS ss ssssi* IS ililF s«° 

TTCCGTACTA CTGACGTAAC toSSSSJJ JSS iSSSKSSI 7 °° 

^GGTTATG CCTGGCGACA ACGTTGAAAT SS£S5S£ JiSSSSSS Inn 

CAATCGCTAT TGAAGACGGT ATCGTTTCTC AATCA -TAATTTCAC 800 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 

^tccaatg CCACAAACTC 

55 2S5SSS SSSSS iSS 

jssss jess E Si EE! F 332 - 

tccaactcca gaacgtgatt ctgS™S JtoSSS ?o^ acat 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG SSISn^ S™I TGAGG 350 
CGTGGTCAAA TCAAAGTTGG TgSSXS?? SSS^Jgg ggSSSJ 400 



50 



450 



252 
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10 



~ ™E liii S iill 
slill EsE His see asss 
Sll S sees = sssss 

TCGCGATTGA AGACGGTACT CGTTTCTCAA C 
2) INFORMATION FOR SEQ ID NO: 203 



(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warnen 

(B) STRAIN: CSG 50 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 

S3S5SS sssss sssss SSSSS 

EEs sssss sssss sssss 
™ii illl ssss K ™ 

H i s^is ssssss ™" ~ s? = 
slss sssssss ss ssssss 
iill sssl 5SSS5 ssssss asss 

S??2SScA AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 
^Stpa PACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 
GTGGACGTCA JACTCCATTC TTACCAGAAG GTACTGAAAT 

ggSSScc? Sg?g^Scg ??5Satgac AGTAGAATTA ATCGCTCCTA 
TCGCGATTGA AGACGGTACT CGTTTCTCA 



2) INFORMATION FOR SEQ ID NO: 2 04 

50 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: , 

(A) ORGANISM: Bifidobacterium longum 

(B) STRAIN: ATCC 15707 

253 



500 
550 
600 
650 
700 
750 
800 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 



TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCCCAGACTC 
5 GCGAGCACGT GCTGCTCGCC CGTCAGGTTG GCGTTCCGAA GATCCTCGTC 
GCCCTGAACA AGTGCGACAT GGTCGACGAT GAAGAGCTCA TcSScTOOT 
CGAAGAAGAG GTCCGCGACC TCCTCGACGA GAACGGC TTC 
GCCCGGTCAT CCACACCTCC GCTTACGGTG CTCTGCACGA CGACGCTCCG 
GACCACGAGA AGTGGGTCCA GTCCGTTAAG GACCTCATGG ACGCTGTcS 
10 CGACTACATC CCGACCCCGG TTCACGACCT GGACAAGCCG TTOCtStOC 
CGATCGAGGA CGTCTTCACC ATCTCCGGCC GTGGTACCGT TCTCACCTrS 
CGTGTCGAGC GTGGCCAGCT GGCCGTCAAC ACCCCGGTCG StcctSS 
TATCCGTCCG ACCCAGCAGA CCACCGTCAC CTCCATCGAG ACCtSScIS 
lq AGACCATGGA CGCCTGCGAG GCTGGCGACA ACACCGGTCT rcSS^GT 
15 GGTCTCGGCC GTGACGATGT CGAGCGTGGC CAGGTTGTGG COuESSS 
SSSSSS 0 CCGCACACCA AGTTCGAGGG CGAAGTcSE 
AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGrAT 
TTCTACTTCC GCACCACCGA CGTCACCGGC GTCATCGAGC TOcSSSS 
CGTCGAGATG GTTCAGCCGG GCGACCACGC TACCTTCACC CTTGaStS 
20 TTCAGCCCAT CGCTATGGAG GAAGGCCTGA CCTTCGCTG GTTGAGCTGA £00 

8 3 9 



25 



2) INFORMATION FOR SEQ ID NO: 205 

( i ) SEQUENCE CHARACTERISE CS : 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Stenotrophomonas maltophilia 

(B) STRAIN: CDC F3338 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205 

40 C £ GCGCGATC CTGGTGTGCT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 

GTGAGCACAT CCTGCTGTCG CGCCAGGTCG GCGTGCCGTA CATCCTcSSr , nn 

TTCCTGAACA AGGCCGACAT GGTCGACGAC GCCGAGCTGC SSSSSS 
CGAGATGGAA GTGCGCGAAC TGCTGAGCAA GTACGAGTTC CCOQqSSJ ™2 
ACACCCCGAT CATCGCCGGT TCGGCCCGCC TGGOSCTGGA S^SSS 

45 AGCGACATCG GCGTGCCGGC CATCCTGAAG CTGGtSa2g SSSSSS? 

ctggattccg gagccggagc gtgcgatcga caagccg??? 

? GGAAGACGT GTTCTCGATC TCGGGCCGCG GCACCGTGGT SoMotSS Inn 

ATCGAGCGCG GCGTGATCAA GGTTGGCGAC GAAATCGAAA TCGTrr£S£ 4 2° 

CCGTCCGGTG CAGAAGACCA CCGTGACCGG CGTTGAAATG ^SSSJ f 5 ° 

50 TGCTGGACCA GGGTCAGGCA GGCGACAACG SSgcctSS ^2° 

ACCAAGCGTG ATGACGTCGA GCGTGGCCAG GTGCTGGCCA aSSSSS Inn 

GATCAAGCCG CACACCAAGT TCGAAGGCGA AGTGTACGTC CTotS? 0 ^ !2° 

^°^ GCGG CCGCCA CACC CCGTTCTTCA ACGGCTACCG nnn 

55 J AC ^ TCCGCA CCACCGACAT CACCGGCGCC GCTGCAC^ CGGAAg1S£ ?gj 

754 

2 ) INFORMATION FOR SEQ ID NO : 2 06 

S 0 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 ( A ) ORGANISM: Streptococcus acidominimus 

(B) STRAIN: ATCC 51726 



TCGTGCTATC CTTGTAGTAG CTTCAACTGA CGGACCAATG CCACAAACTC 
£?S3ScAT CCTTCTTTCA C GTCAAGTTG GTGTTAAAAA CCTTATCGTT 
SStgaaS AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 
tcaaaSSaa ATCCGTGACC TTCTTTCAGA atacgatttc CCAGGTGATG 
IS™?AGT TGTTCAAGGT TCAGCTCTTA AAGCG CTTGA AGGTGATTCA 
Siriir ATGTTATCAT GGAATTGATG TCAATCGTTG ACACATACAT 

tcc^Saac^a Stgacaaacc attgcttctt ccagtcgagg 

ISptattctc aatcactgga cgtggtactg ttgcttcagg acgtatcgac 
SSSSSKg ttaaagttaa tgacgaagtt gaaatcgttg gtatcaaaga 
SESKtctct aaagcagttg ttactggtgt tgaaatgttc cgtaaacaac 
???acgaagg tcttgctgga gataacgttg gtgttcttct tcgtggtgta 
™Sg?£atg aaatcgaacg tggtcaagtt cttgctaaac caggttcaat 

C^SScS ACTAAATTCA AAGGTGAAGT TTACGTTCTT TCTAAAGAAG 
^Sr^aCG TCACACTCCA TTCTTCGATA ACTACCGTCC TCAGTTCTAC 
TTCCGTACAA SgISSaAC TGGTTCAATC AAATTGCCAG AAGGTACTGA 
aat^taSg cctggtgata acgtaactat CGAAGTTGAG TTGATCCACC 
CAATCGCCGT TGAACAAGGT ACTACTTTCT CTATC 



35 2) INFORMATION FOR SEQ ID NO: 207 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
40 (c) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactxae 

(B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 

50 CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACT CGTGAG 50 
SStcS?? tt?cacgtca AGTTGGTGTT AAACACCTTA TCGTATTCAT 
Saacaaaott GACCTTGTTG atgatgaaga attgcttgaa TTGGTTGAAA 
??Saaattcg tgaccttctt tcagaatacg acttcccagg TGATGACCTT 

« rct^ScC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ACGAAAAATA 250 

SgAaSS^C JtCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

Sc1S£aACG TGATACTGAC AAACCTTTAC TTCTTCCAGT TGAAGATGTA 350 

T^C?CAA^CA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

60 TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 



100 
150 
200 
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GAAGGTCTTG CAGGGGACAA 
TGATGAAATC GAACGTGGTC 
CACACACTAA ATT TAAAGGT 
GGACGTCATA CTCCATTCTT 
5 TACAACTGAC GTAACAGGTT 
TTATGCCTGG TGATAACGTT 
GCCGTAGAAC AAGGTACTA 



CGTTGGTGTT CTTCTTCGTG 
AAGTTCTTGC TAAACCAGGT 
GAAGTTTACA TCCTTTCTAA 
CAACAAC TAC CGTCCACAAT 
CAATCGAACT TCCAGCAGGA 
ACTATCGAAG TTGAATTGAT 



GTGTTCAACG 550 

TCAATCAACC 60 0 

AGAAGAAGGT 650 

TCTACTTCCG 70 0 

ACAGAAATGG 750 

TCACCCAATC 800 
819 



10 



2) INFORMATION FOR SEQ ID NO: 208 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 819 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii> MOLECULE TYPE: Genomic DNA 

20 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12 973 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 

CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 
CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCotSJSS 
,n ^^ AGTT ^TTGTTG ATGATGAAGA ATTGCTTGAA TtSSSSI 

30 ^^ ATTCG TGACC TTCTT TCAGAATACG ACTTCCCAGG TGAtScCT^ 
n^^ ATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ATGAA^S 
CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 
AACCAGAACG TGATACTGAC AAAC CTTTAC TTCTTCCAGT JSSS^CTA 
TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCctS 

35 TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT JSSSgJS 
TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAr 
GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG gJotJSSg 
^JS^™ GAA^TGGTC AAGTTCTTGC TAAACCAGGT TCAATCAACC 

An CACACACTAA ATTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT 

40 GGACGTCATA CTCCATTCTT CAACAACTAC CGTCCACAAT T^^S ° 
TACAACTGAC GTAACAGGTT CAATCGAACT TcSgCAgSJ J525^ 
^^GCCTGG TGATAACGTT ACTATCGAAG TTGAATTGAT tScC^TC 
GCCGTAGAAC AAGGTACTA i^alllmtc 8 00 

819 

45 

2) INFORMATION FOR SEQ ID NO: 2 09 

(i) SEQUENCE CHARACTERISTICS' 
50 (A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 13813 

6 0 



50 
100 
150 
200 
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350 
400 
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500 
550 
600 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 
AGCTATCCTT GTAGTTGCTT ^CTGATGG ACCAATGCCA C^CTCGTG 

iSSSSS SSSSS5 ^SSS S£?£5E aattggttga 

ATGAACAAAG TTGACCTT^l ttTCAGAATA CGACTTCCCA GGTGATGACC 
AATGGAAATT CGTGACCTTC TTTCAGAAIA ^ CGATGAAAAA 

SSSSS actgttgatg agtacattcc 

™4S iiii sssss ssss ssssss 
-iiii EEi is sssss ssssa 
» £Si issss? sees ss 
IIEI iS ssss ssss sssss 
Si «S ssss ssss ssss 

TCGCCGTAGA ACAAGGTACT AC 



2) INFORMATION FOR SEQ ID NO: 210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

35 (B) STRAIN: CDC 1073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 

«g»SS SSS5S iS 

40 ssssss ss™ lis SS^S o°o 

45 Hil ilii SI = as 

50 CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC ^GTTCAA ^ 

CCTGGTGATA ACGTTACTAT CGAAGTTGAA TTGATTCACC 800 
55 CAATCGCCGT AGAACAAGGT ACTAC 



2) INFORMATION FOR SEQ ID NO: 211 

60 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 
10 ( A ) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 33397 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 

15 GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CTCAAACTCG 
TGAACACATC CTTCTTTCAC GCCAAGTAGG TGTTAAATAC CTTATCGTCT 
TCATGAATAA AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT 
GAAATGGAAA TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA 
AATCCCAGTT ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATGAAA 2"?fl 

2 0 AATATGAAGA CATCATCATG GAATTGATGG ATACTGTTGA TGAATACATT 3 00 

CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTTGAAGA. 3 SO 

TGTATTCTCA ATTACTGGAC GTGGTACTGT TGCTTCAGGA CGTATCGACC Ho 

GTGGTACTGT TAAAGTCAAC GACGAAGTTG AAATCGTTGG TATCCGTGAT 4^0 

GAAATCCAAA AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAATT SOO 
25 GGACGAAGGT CTTGCTGGAG ATAACGTAGG GGTTCTTCTT CGTGGTATCC 
AACGTGACGA AATCGAACGT GGACAAGTTC TTGCTAAACC AGGTTCAATT 
CATCCACACA CTAAATTCAA AGGTGAAGTT TACATCCTTA CTAAAGAAGA 
AGGTGGACGT CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTACT 

TCCGTACTAC AGACGTTACA GGTTCAATCG AACTTCCTGC AGGTACTGAA 750 

3 0 ATGGTAATGC CTGGTGATAA CGTAACAATC GACGTTGAAT TGATCCACCC 8 00 

AATTGCCGTA GAACAAGGAA CTACAT ^AlCCACCC 800 

82 6 



50 
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35 2) INFORMATION FOR SEQ ID NO: 212 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus Jbovis 

(B) STRAIN: ATCC 33317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 

TGGTGCTATC CTTGTAGTAG CTTCTACAGA TGGTCCAATG CCACAAACAC , n 
GTGAACACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCotS inn 
TTCATGAACA AAGTTGACCT TGTTGATGAC GAAGAATTGC TTgSSgOT ill 
TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATn onn 

55 ^^ CCTGT ^CCAAGGT TCAGCTCTTA aJScSSS aSJSSS °° 
CACTACGAAG ACATCATCAT GGAATTGATG AACACTGTAG ATGAATACAT 
TCCAGAACCA AAACGTGATA CTGACAAACC ATTGCTTCTT cSotcgSg 
n^Z* CTC AAT^TGGT CGTGGTACTG TAGCATCAGG ACG?a5?3J? 
CGTGGTACTG TTAAAGTCAA CGACGAAGTT GAAATCGTTG GTATCCGTG? 

60 CGACATCCAA AAAGCTGTTG TTACTGGTGT TGAaSStc 2gSSS too 

258 
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TTGATGAAGG TATCGCAGGG GATAACGTTG GTGTTCTTCT TCGTGGX^TC 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT Lli ACTAAAGAAG 650 

CCACCCACAC ACTAAATTCA AAgTGAAGT JTACA1 tcaattctac 700 

AAGGTGGACG TCACACTCCA JJCTTCAACA ^ CAGGTACTGA 750 

5 SSSSSS SS 5SSSSS CGACGTTGAA TTGATTCACC 800 
CAATCGCCGT TGAACAAGGT ACTACAT 

10 2) INFORMATION FOR SEQ ID NO: 213 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 821 bases 
c ( B ) TYPE: Nucleic acid 

lb (c) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi)ORIGI^^ = strepI:ococous ^inosus (deposited as 
Streptococcus constellatus) 
(B) STRAIN: ATCC 27823 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 
GCTATCCTCG TAGTAGCTTC JACTGATGGA CCAATGCCTC AAACTCGTGA 
ACATATCCTT CTTTCACGTC AAGTAGGTGT JAAATACCT^ AT "pGGTTGAA 
30 TGAACAAAGT TGACTTGGTT ^CGATGAAG £££££ GTGATGAAAT 
ATGGAAATCC GTGACCTTCT TTCAGJAT^C ^TTC^ QATGAAAAAT 
CCCAGTTATC CAAGOTTCAG CTCTTAAAGC gT^AA ATACATTCCA 
ATGAAGACAT CATCATGGAA JTGATGGATA £TGTT^ TCGAAGATGT 
GAACCAGAAC GTGACACTGA ^CCACTT CTTCTiU ATCGACCGTG 
35 ATTCTCAATC ACTGGACGTG gJACTGTTGC TTCAGGAC^ TCGTGACGAA 
GTACTGTTAA AGTCAATGAT ^AAGTTOAAA ^ AACAATTGGA 
ATCCAAAAAG CAGTTGTTAC TGGTGTTGAA ATGTTCCGTA AA ^ 
CGAAGGTCTT GCTGGAGATA ACGTAGGGGT TCTTCTTCG 
GTGACGAAAT CGAACGTGGA CAAGTTCTTG ^TAAAC 
40 CCACACACTA AATTCAAAGG TGAAGTTTA^ CCTCAA TTCTACTTCC 

TGGACGTCAT ACTCCATTCT TCAACAACTA CCGTCCTCAA 

GTAATGCCTG SSSSS ESSSKE GTTGAGTTGA TCCACCCAAT 
TGCCGTAGAA CAAGGAACTA C 
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45 



2 ) INFORMATION FOR SEQ ID NO : 214 

, 0 (i) SEQUENCE CHARACTERISTICS: 

* (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

f vi ) ORIGINAL SOURCE: 

(A) ORGANISM: streptococcus cncetus 

(B) STRAIN: ATCC 19642 

259 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214 

GCTATCCTTG TAGTAGCTTC TACAGACGGA CCAATGCCAC AAACTCGTCA 
ACACATCTTG CTTTCACGCC AAGTTGGTGT TAAGAGCCTT ATCOTCTtS 
TGAACAAGGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 
ATGGAAATCC GTGATCTTCT TTCAGAATAC GATTTCCCAG GTGATGATAT 
ESSSE SSSJSS? 2£™»2? CCTTGAAGGT GAtSS 



50 
100 
150 

ccgaagacaa gatcatgga^ ttgatggaca tcgtt^tca SJaStcS is 

10 SKS™* ™££S!™ CTTCTTcSS SS5£^ 35S 

400 
450 

15 gtgatgaaat cgaacgtggt caagtattgg ctgcacSS ESSSE E 
gtacaactga cgtaactggt tcaICtcgaat toccagc^g SSSKSS !H 

on ^^ TGCCTG GTGATAACGT TACTATCGAC GTTGAATTGA TcScCCAA? 8 n 
20 CGCTGTTGAA AAAGGTACTA C i«-«-ALL.caat 8 00 

821 



ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 
GTACTGTTAA GGTCAATGAC GAAGTTGAAA TCGTTGGTAT CAAGGACGAA 
ATCCAAAAAG CGGTTGTTAC CGGAGTTGAA ATGTTCCGTA AACAAT?gSJ 
TGAAGGTCTT GCAGGGGATA ACGTTGGTGT GcScTTCGT 
GTGATGAAAT CGAACGTGGT CAAGTATTGG CTGCACCTGG TTCAATCCA? 
CCACACACTA AATTCAAGGG TGAAGTTTAC ATCCTTTCTA AAGATGAATC 
^GA^TCAC ACTCCATTCT TCAACAACTA CCGTCCaSg 



25 



2) INFORMATION FOR SEQ ID NO: 215 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Streptococcus cristatus 

(B) STRAIN: ATCC 51100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 

4 0 TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACTCCTrArn 
iSSSSEF TTCACGTCAG GTTGGTGTTA SSS^S^ otSSSSS 
AACAAGATCG ACTTGGTTGA TGACGAAGAA TTGCTTGAAT TgSttSSJ? 
G ™ TCCGT GACCTCTTGT CAGAATACGA CTTCCCAGGT ScGATOT? 
CAGTTATCCA AGGTTCAGCT CTTAAAGCTC TTGAAGGTGA ScTA^ctIp 

4 5 GAAGACATCA TCATGGAATT GATGAACACT GTTGATGAGT AcSSSSSJ 
ACCAGAACGT GATACTGACA AACCTCTTCT TCTTCCAGTC GAAgSc^ 
TCTCAATCAC TGGTCGTGGT ACAGTTGCTT CAGGACGTAT cS^SSS 
JSJSES? 3 TCAAC ^TGA AATCGAAATC GTTGGTATCA 
qn ^ ( ^^ GCA GTTGTTACTG GTGTTGAAAT GTTC CGTAAA CAGCTtS^G 

aggggacaac gtaggtgtac ttcttcgtgg ta?c?Scot 

GATGAAATCG AACGTGGTCA AGTTATCGCT AAACCAGGTT C^5^£S 
ACACACTAAA TTCAAGGGTG AAGTTTACAT CCTTACTAAA GAAgSSgSS 
GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAGTT 

J^TGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA S^SS? lln 
55 AATGCCTGGT GATAACGTAA CTATCGACGT TGAGTTGATC CACCCAA??? Znn 
CCGTTGAACA AGGTACTCCT T L CCAATCG 8 00 

821 
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60 2) INFORMATION FOR SEQ ID NO: 216 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 92 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: , „. 

(A) ORGANISM: Streptococcus downei 

(B) STRAIN: ATCC 33748 



(xi) sayuiirjun — - 

AGTAGCTTCT ACT^TGGAC CAATGgACA AACTCGTGAA 

SSSS 1=1 3K3SS S= 

S IS ISa? sesss 

ATCATGGAAT JGATGGACAT gTTGA^ CGAAGATGTA TTCTCAATCA 

SKSSS ?|ggacgta tcgaccgxgg tactgxtaag 

JSSSSSS I KE gaaggxcxxg 

CAG ^n^ SESSSSS ?gS?CTGGT ??5aTTCACC CACACACTAA 
SESEES SESSSg ll-CXAA AGAAGAAGGT GGACGTCATA 

S5SSSSS SS SSSSSSS S^aI5| ttatgcctgg 

tg^SSS acWgacg ttgaattgat ccacccaatt gc 



3 5 2) INFORMATION FOR SEQ ID NO: 217 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 
40 (c) STRANDEDNESS: Double 

( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: a ^tSam 

(A) ORGANISM: Streptococcus dysgalactrae 

(B) STRAIN: ATCC 43078 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 217 
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TGCAGGAGAC AACGTAGGTA TCCTTCTTCG TGGTGTTCAA CGTGACGAAA 
TCGAACGTGG TCAAGTTATT GCTAAACCAG GTTCAATCAA CCCACACACT 
AAATTCAAAG GTGAAGTATA TATCCTTTCT AAAGACGAAG GTGGACGTCA 
CACTCCATTC TTCAACAACT ATCGTCCACA ATTCTACTTC CGTACAACTG 
ACGTAACAGG TTCAATCGAA CTTCCAGCTG GTACAGAAAT GGTTATGCCT 
GGTGATAACG TGACAATCAA CGTTGAGTTG ATCCACCCAA TCGCC 



2 ) INFORMATION FOR SEQ ID NO: 218 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus equi subsp. egui 

(B) STRAIN: ATCC 9528 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 

CGGAGCTATC CTTGTAGTTG CTTCAACTGA CGGACCAATG CCACAAACTC 
GTGAGCACAT CCTTCTTTCA CGTCAGGTTG GTGTTAAGCA CCTTATCGTG 
TTCATGAACA AGGTTGACCT TGTTGACGAT GAAGAATTGC TTGAGCTTGT 
TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 
ACCTTCCAGT TATCCAAGGT TCAGCGCTTA AGGCTCTTGA AGGCGACAGC 
AAATACGAAG ATATCATCAT GGAATTGATG GATAC TGTTG ATTCATACAT 
TCCAGAACCA GAACGTGACA CAGACAAGCC ATTGCTTCTT CCAGTCGAGG 
ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 
CGCGGTACTG TTCGTGTTAA CGACGAAATC GAAATCGTTG GTATCAGAGA 
CGAGATCAAA AAAGCAGTTG TTACTGGTGT CGAAATGTTC CGTAAACAGC 
TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTA 
CAACGTGATG AAATCGAACG TGGTCAAGTT ATTGCTAAGC CAGGTTCTAT 
CAACCCACAC ACTAAATTTA AAGGTGAAGT ATATATCCTT ACTAAAGAAG 
AAGGTGGACG TCACACACCA TTCTTCAACA ACTATCGTCC ACAATTCTAC 
TTCCGTACTA CTGACGTAAC AGGTTCAATC GAGCTTCCAG CAGGTACAGA 
AATGGTTATG CCTGGTGATA ACGTGACTAT TGACGTTGAG TTGATCCACC 
CAATCGCCGT AGAACAAGGT ACTACATT LC 



2) INFORMATION FOR SEQ ID NO: 219 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ferus 

(B) STRAIN: ATCC 334*77 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 
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CGGTGCAATC CTTGTAGTAG CTTCTACAGA JgJgJJ™ ™ 
GTGAGCACAT CCTTCTTTCA CGTCAGGTAG falto TTGAATTGGT 
TTCATGAACA AAGTTGACTT ^TGACGAT ^ CCAGGTGATG 
5 TGAAATGGAA ATCCGTGACC TGCTTTCAGA ^CGCTTGA AGGTGATACT 
ACCTTCCAGT TATCCAAGGT TCAGCTC1 1A ATGAGTACAT 
GCTCAAGAAG ATGTTATCAT ATTGCTTCTT CCAGTCGAAG 

CCCAGAACCA GAACGTGATA CTGACAAACC ACGT AT CGAT 

ATGTATTCTC AATCACAGGT CGTGGTACTG JAGCTT GT AT C AAAG A 
10 CGTGGTACTG JAAGAGTCAA ^GAAGTT GAAATC^^ CGTAAACAAT 
CGAAATCACT AAAGCAGTTG TTALUOb^ GTGTGCTTCT CCGTGGTGTG 
TGGAC GAAGG TCTTGCTGGT GATAACGTTG ^GTGC^ cAGGTTCAAT 
CAACGTGATG AAATCGAACG TGGTCAAGTA Ti^ ACTAAAGAAG 
CAACCCACAC ACTAAATTTA AAGGTGAAGT acagttctac 
15 AAGGTGGACG TCATACACCA TTCTTCAACA ^ Q cAGGTACTGA 

2SSSSS JSSSS5 Sacgttgaa TTGATCCACC 

CAATCGCCGT TGAACAAGGT ACTAC 



2) INFORMATION FOR SEQ ID NO: 220 

( i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 826 bases 

( B ) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonn 

(B) STRAIN: ATCC 10558 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 

CGGAGCTATC CTTGTAGTAG CTTCAACTGA TGGTCCTATG CCACAAACTC 

0 5KS£S SSS GGTTGACGAT GAAGAATTGC TTGAGTTGGT 
40 TTCATGAACA AAGTTGACT1 ATACGACTTC CCAGGTGACG 

TGAAATGGAA ATCCGTGACC TCTTGTCAGA AX A A GGTGACTCT 
ATCTTCCAGT TATCCAAGGT TCAGCTC11A aa ATGAGTACAT 
AAATATGAAG ATATCATCAT GGAATTGATG AA^^ CCAGTCGAAG 
CCCAGAACCA GAACGCGACA CTGACAAACC ATTGCTTCTT CCJOTC^ 
45 ACGTATTCTC AATCACTGGA CGTGGTACAG J££ GTATCAAAGA 
CGTGGTATCG JTAAAGTCAA TGACGAAATC cGTAAACAGC 
AGAAATCCAA AAAGCAGTTG TTAU^i GTGTGCTTCT TCGTGGTATC 
TTGACGAAGG TCTTGCAGGG ^C^CGTTG ^GTGCTTC CAGGTTCAAT 
CAACGTGATG AAATCGAACG TGGACAAGTT Ali^ ACTAAAGAAG 
50 CAACCCACAC ACTAAATTCA AAGGTGAAGT JJATATCCT ACAGTTCTAC 

SSI US « SSSSSS SS5SB 

CAATCGCCGT TGAACAAGGT ACTACT 



55 



2) INFORMATION FOR SEQ ID NO: 221 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 

TGTAGTAGCT TCAACTGACG GACCAATGCC TCAAACTCGT GAACATATCC 
TTCTTTCACG TCAAGTAGGT GTTAAATACC TTATTGTCTT CATGAACAAA 
GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 
CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGATGAT ATTCCAGTAA 
TCCAAGGTTC AGCACTTAAA GCTCTTGAAG GTGATGAAAA ATATGAAGAC 
ATCATCATGG AATTGATGAA TACTGT TGAT GAATATATTC CAGAACCAGA 
ACGTGATACT GACAAACCAT TGCTTCTTCC AGTCGAAGAT GTATTCTCAA 
TCACTGGACG TGGTACTGTT GCTTCAGGAC GTATCGACCG TGGTACTGTT 
AAAGT CAACG ATGAAGTTGA AATCGTTGGT ATCCGCGAGG AAATCCAAAA 
AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAATTG GACGAAGGTC 
TTGCTGGAGA TAACGTAGGG GTTCTTCTTC GTGGTATC C A ACGTGACGAA 
ATTGAACGTG GACAAGTTCT TGCTAAAC CA GGTTCAATTC ATCCACACAC 
TAAATTCAAA GGTGAAGTTT ACATCCTTAC TAAAGAAGAA GGTGGACGTC 
ATACTCCATT CTTCAACAAC TACCGTCCTC AATTCTACTT CCGTACTACA 
GACGTTACAG GTTCAATCGA ACTTCCTGCA GGTACTGAAA TGGTAATGCC 
TGGTGATAAC GTAACAATTG ATGTTGAGTT GATCCACCCA ATTGCCGTA 



2) INFORMATION FOR SEQ ID NO: 222 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus macacae 

(B) STRAIN: ATCC 35911 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 

TGGTGCTATT CTTGTAGTAG CTTCAACTGA CGGTCCAATG 
GTGAACATAT CCTTCTTTCA CGCCAAGTAG GTGTTAAAAA 
TTCATGAATA AAGTTGACTT AGTTGATGAT GAAGAATTGC 
TGAAATGGAA ATCCGTGATC TTCTTACAGA ATATGATTTC 
AACTTCCAGT TATCCAAGGT TCAGCACTTA AAGCTCTTGA 
AAGTACGAAG ATATTATCAT GGAATTGTTG GATACTGTAG 
CCCAGAACCA CAACGTGATA CTGACAAGCC ATTGCTTCTT 
ATGTTTTCTC TATTACTGGA CGTGGTACTG TTGCTTCAGG 
CGTGGTACTG TTAAGGTTAA TGATGAAGTT GAAATCGTTG 
CGATATTCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC 
TTGACGAAGG TCTTGCTGGT GATAACGTCG GTGTCCTTCT 
CAACGTGATG AAATTGAACG CGGTCAAGTT CTTGCTAAAC 



CCTCAAACGC 
CCTTATTGTT 
TTGAATTGGT 
CCAGGCGATG 
AGGTGATACT 
ATGATTACAT 
CCAGTCGAAG 
ACGTATTGAC 
GTATTCGTGA 
CGTAAACAGC 
TCGTGGTATC 
CAGGATCAAT 
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TCATCCACAT ACTAAATTCA 
AAGGTGGACG TCATACTCCA 
TTCCGTACAA CTGATGTAAC 
AATGGTTATG CCTGGTGATA 
5 CAATCGCTGT TGAACAAGGT 



AAGGTGAAGT TTATATTCTT 
TTCTTTAACA ACTACCGTCC 
TGGTTCAATT GATTTGCCAG 
ATGTTACGAT TGATGTTGAA 
ACAAC 



ACTAAAGAAG 650 
AC AGTT CT AC 70 0 

CAGGTACTGA 750 
CTGATCCACC 8 00 

825 



2) INFORMATION FOR SEQ ID NO: 223 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 ( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 



lVi> ^; GIN ^S- Streptococcus oordonii (deposited as 
Streptococcus mitls) 
(B) STRAIN: ATCC 33399 



45 



SSSE SBSS ssss SJSSS ;ssk 
~ SS SSSSSS sss ssss 

GCCGTAGAAC AAGGTACTAC AT 



2) INFORMATION FOR SEQ ID NO: 224 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 827 bases 
c 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: ATCC 25175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 
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CCCTGGTGCT ATCCTTGTAG TAGCTTCAAC TGATGGACCA ATGCCACAAA 50 

CTCGTGAACA CATTCTTCTT TCACGTCAAG TTGGTGTTAA ATACCTCATT 100 

GTCTTCATGA ATAAAGTTGA TTTGGTTGAC GATGAAGAAT TGCTTGAATT 150 

5 GGTTGAAATG GAAATCCGTG ATCTTCTTTC AGAATATGAT TTCCCAGGTG 200 

ATGATATTCC AGTTATTCAA GGTTCAGCTC TTAAAGCTCT TGAAGGCGAT 250 

ACTGCTCAAG AAGATATCAT CATGGAATTA ATGCATACTG TTGATGACTA 3 00 

CATTCCAGAT CCAGAACGTG ATACTGACAA GCCGCTCCTT CTTCCAGTCG 350 

AAGATGTTTT CTCAATCACT GGTCGTGGTA CTGTTGCTTC AGGACGTATT 4 00 

10 GATCGTGGTA CTGTTAAAGT TAACGATGAA GTTGAAATCG TTGGTATCCG 450 

TGATGACATT CAAAAAGCTG TTGTTACTGG TGTTGAAATG TTCCGTAAAC 500 

AATTGGATGA AGGTATTGCA GGGGATAATG TTGGTGTTCT CCTTCGTGGT 550 

ATCCAACGTG ATGAAATCGA ACGTGGTCAA GTTCTTGCTA AACCAGGTTC 600 

AATTCACCCA CATACTAAAT TCAAAGGTGA AGTTTATATC CTTACTAAAG 650 

15 AGGAAGGTGG ACGTCATACA CCATTCTTCA ATAACTATCG TCCACAATTC 70 0 

TACTTCCGTA CAACTGACGT AACTGGTTCA ATTGAGTTGC CAGCAGGTAC 750 

TGAAATGGTT ATGCCTGGTG ATAACGTTAC TATTGACGTT GAATTGATCC 800 

ATCCAATCGC TGTTGAACAA GGTACTA 827 

20 



2) INFORMATION FOR SEQ ID NO: 225 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus parasanguinis 

(B) STRAIN: ATCC 15912 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 



AGCTATCCTT GTAGTAGCTT CAACTGACGG ACCAATG CCA CAAACACGTG 50 

AACACATCCT TCTTTCACGT CAGGTTGGTG TTAAACACTT GATCGTCTTC 100 

4 0 ATGAACAAAG TTGACTTGGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

AATGGAAATC CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC 2 00 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG TGACTC TAAA 2 50 

TATGAAGATA TCATCATGGA ATTGATGGAT ACTGTTGATG AGTACATCCC 3 00 

AGAACCAGAA CGCGATACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG 3 50 

45 TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 4 00 

GGTGTTGTTC GTGTCAATGA TGAAATCGAA ATCGTTGGTA TCAAAGAAGA 4 50 

AATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 5 00 

ACGAAGGTCT TGCAGGGGAT AACGTTGGTG TGCTTCTTCG TGGTATCCAA 550 

CGTGATGAAA TCGAACGTGG ACAAGTTATC GCTAAACCAG GTTCAATCAA 6 00 

50 CCCACACACT AAATTCAAAG GTGAAGTTTA CATCCTTACT AAAGAAGAAG 650 

GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTAACTGG ATCTATCGAA CTTCCACCAG GAACTGAAAT 7 50 

GGTAATGCCT GGTGATAACG TGACTATCGA CGTTGAGTTG ATCCACCCAA 8 00 

TCGCCGTTGA ACAAGGTACT ACAT go 4 

55 



60 



2) INFORMATION FOR SEQ ID NO: 22 6 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: raf) .. 
(A) ORGANISM: Streptococcus ratti 

10 ( B ) STRAIN: ATCC 19645 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 
TGGTGCTATC CTOggg CTTC^CTG* ^ SSSSSE 
» SS SSSSgx S ^ggggC Tgjj™- 
TGAAATGGAA ATC CGTGAT C TTCTTTCAGA ATACGAli CT 
ACATTCCAGT TATCCAAGGT TCAGCCCTTA AAGCTCTTGA ACAT 
GAACAAGAAG ATGTTATCAT GGAATTGATG JAAA^I CCAGTCGAAG 
20 CCCAGATCCA GAACGCGATA CTGATAAGCC ATTGCTTCTT 

ACGTGTTCTC AATCACTGGA CGTGGTAC1G U^J GTATCCGTGA 
CGTGGTACTG TTAAAGTCAA TGACGAAGTT GAAAjLCG CGTAAACAGC 
TGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGll TCGTGGTATC 
TTGACGAAGG TCTTGCTGGT gTAACGTTG GTGTACTT^ CAGGTTCAAT 
25 CAACGTGATG AAATCGAACG C ^TCAAGTT Ui* ACTAAAGAAG 
TCATCCGCAT ACTAAATTTA AAGGTGAAGT ™CATCCTT acagttctac 
AAGGCGGACG TCACACTCCA TTCTTUAAU^ qaattgCCAG CAGGTACTGA 
StSS CGACGTTGAA TTGATCC^CC 

3 0 CAAT CGCTGT TGAACAAGGT ACTA 



35 
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2) INFORMATION FOR SEQ ID NO: 2 27 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



. ■■■■■ 

60 t?^SgSI cSSiAGGT GTGCTTCTCC GTGGTATCCA ACGTGATGAA 
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(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Streptococcus sanguinis 

(B) STRAIN: ATCC 10556 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 

50 TGTAGTAGCT TCAACT^ T^TrHTCTT IATGAACAAA loS 
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ATCGAACGTG GACAAGTTAT CGCTAAACCA 
TAAATTCAAG GGTGAAGTTT ATATCCTTAC 
ACACTCCATT CTTCAACAAC TACCGTCCAC 
GACGTTACAG GTTCAATCGA ACTTCCAGCA 
5 TGGTGATAAC GTAACAATCG ACGTTGAGTT 



GGTTCAATCA ACCCACACAC 600 

TAAAGAAGAA GGCGGACGTC 650 

AGTTCTACTT CCGTACAACT 700 

GGTACTGAAA TGGTAATGCC 750 

GATCCACCCA ATCGC 795 



10 



2) INFORMATION FOR SEQ ID NO: 228 



( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

2 0 (A) ORGANISM: Streptococcus sobrinus 

(B) STRAIN: ATCC 33478 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 

25 TGTAGTAGCT TCTACTGACG GACCAATGCC ACAAACTCGT GAACACATCT 50 

TGCTTTCACG CCAAGTTGGT GTTAAGAAC C TCATCGTCTT CATGAACAAG 10 0 

GTTGACTTGG TTGATGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 15 0 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGACGAC ATTCCTGTTG 2 00 

TTCAAGGTTC AGCTCTTAAG GCTCTTGAAG GTGATACAGC TGCCGAAGAC 25 0 

3 0 AAGATTATGG AATTGATGGA CATCGTTGAT GATTACATTC CAGAACCAAA 3 00 

ACGCGATACT GATAAGCCAT TGCTTCTCCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACTGTT GCTTCAGGAC GTATTGACCG TGGTACTGTT 4 00 

AAGGTTAACG ACGAAGTTGA AATCGTTGGT ATCCGTGACG ATATCCAAAA 450 

AGCAGTTGTT ACTGGAGTTG AAATGTTCCG TAAGCAATTG GACGAAGGTC 500 

35 TTGCTGGAGA TAACGTTGGT GTGCTTCTTC GTGGTATCCA ACGTGATGAA 550 

ATTGAACGTG GTCAAGTATT GGCTGCACCT GGTTCAATCC ACCCACACAC 60 0 

TAAGTTCAAG GGTGAAGTTT ACATCCTTTC TAAAGATGAA GGTGGACGTC 65 0 

ACACTCCATT CTTCAACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 700 

GACGTAACTG GTTCAATCGA ATTGCCAGCA GGTACTGAAA TGGTTATGCC 750 

4 0 TGGTGATAAC GTTACTATCG ACGTTGAATT GATCCACCCA ATCGC 795 



45 



2 ) INFORMATION FOR SEQ ID NO: 22 9 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC 437 65 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

60 TGTAGTAGCT TCAACTGACG GTCCAATGCC ACAAACTCGT GAGCACATCC 50 
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TTCTTTCACG TCAGGTTGGT ™£CC ™TCGTCTT ™^ 
GTTGACTTGG TTGACGATGA AGAATTGCTT CTTCCAGTTA 
CCGTGACCTT CTTTCAGAAT ACGATTTCCC AGGTGA GTACGAAGAC 
TCCAAGGTTC AGCTCTTAAA GJTCTTGAAG gTGAU caqaaccAGA 

5 atcgttatgg aattgatgaa ^tgttgat gagta aggac gtattctcaa 

ACGCGACACT GACAAACCAT TGTOLi i^u CCG TGGTACTGTT 

TCACTGGTCG TGGTACTGTA. GJTTCAGGAC JTATCGACC AAAAATCTAA 
CGTGTCAACG ACGAAATCGA AATCGTTGGT CTTCAAGAA^ ^3^^ 
AGCAGTTGTT ACTGGTGTTG JAATGTTCCG TAAACAAC^ 
10 TTGCCGGCGA TAACGTTGGT gTGCaiUii GGTTCTATCA ACCCACACAC 

ATCG ^?IS SSSJSc TAAAGAAGAA GGTGGACGTC 

TAAATTCAAA GGTGAAGTTT WAJ^v AGTTCTACTT CCGTACAACT 
ACACTCCATT CTTCGACAAC ^TCCAC Jgg^^ TGGTAATGCC 

15 ?5S?GA^AAC ACGTTGAATT GATCCACCCA ATCGCCG 
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2) INFORMATION FOR SEQ ID NO: 230 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 ( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Streptococcus uteris 

( B ) STRAIN: ATCC 19436 



slllsillttil 

SSKSSS aaatcgttgg tatcaaagaa gaaactaaaa 
TSSAttgt tactggtgtt gaaatgttcc gtaaacaact tgacgaaggt 

AAGCAGTTGT TACT^i^xi ^ CGTGGTGTTC AACGTGACGA 

S??SSS? GGACAAGTTA SgcSaACC AGGTTCAATC AACCCACACA 
CTAAATTCAA AgS5gAAGTT TACATCCTTT CTAAAGATGA AGGTGGACGT 
rATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTATT TCCGTACAAC 
SSacgtaaca GGTTCAATCG aacttccagc TGGTACTGAA ATGGTAATGC 
c?gS?5Saa cgtgacaatc agcgttgagt tgatccaccc aat 



2) INFORMATION FOR SEQ ID NO: 231 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Streptococcus vestibularis 

(B) STRAIN: ATCC 49124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 

10 TTGTAGTAGC ATCTACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA TATTCCAGTT 2 00 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

15 CATCATCATG GACTTGATGA ACACTGTTGA CGAATACATT CCAGAACCAG 3 00 

AACGTGACAC TGACAAACCA TTGTTGCTTC CAGTCGAAGA CGTATTCTCA 3 50 

ATCACTGGTC GTGGTACTGT TGCTTCAGGA CGTATCGACC GTGGTGTTGT 4 00 

TCGTGTTAAT GACGAAGTTG AAATCGTTGG TCTTAAAGAA GAAATCCAAA 4 50 

AAGCAGTTGT TACTGGTGTA GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

2 0 ATTGCCGGAG ATAACGTCGG TGTCCTTCTT CGTGGTATCC AACGTGATGA 550 

AATTGAACGT GGTCAAGTAT TGGCTGCACC TGGTTCAATC AACCCACACA 60 0 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGAAGA AGGTGGACGT 650 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAGTTCTACT TCCGTACAAC 7 00 

TGACGTAACA GGTTCAATCG AACTTCCTGC AGGTACTGAA ATGGTTATGC 750 

2 5 CTGGTGATAA CGTGACTATC GACGTTGAGT TGATCCACCC AATCGCCG 798 



30 



2) INFORMATION FOR SEQ ID NO: 232 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE. TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Tatumella ptyseos 

(B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

4 5 GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGTGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TCCGTGACCT GCTGTCACAG TACGACTTCC CGGGTGACGA 2 00 

CACGCCAATC GTTCGCGGTT CAGCGCTGAA AGCACTGGAA GGTGAAGGCG 250 

50 AGTGGGAAGA GAAGATTCTG GAGCTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

CCTGAGCCAG AGCGTGCTAT CGATCAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCA ATCTCCGGTC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 4 00 

GCGGGATCAT CAAAGTCGGT GAAGAAGTTG AGATCGTTGG TATCAAAGAT 4 50 

ACTGCGAAAT CAAC CTGTAC CGGTGTTGAA ATGTTCCGTA AACTGCTGGA 5 00 

55 CCAGGGTCAG GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGAGAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

CCACACACCC AGTTCGAGTC AGAAGTTTAT ATTCTGTCTA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGA ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

60 GTAATGCCTG GTGACAACAT CAAAATGGTT GTTACCCTGA TCCATCCAAT 80 0 

270 
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CGCGATGGAC GATGGTCTGC GTTTCGCAA 

2) INFORMATION FOR SEQ ID NO: 233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



829 



(ii) MOLECULE TYPE: Genomic DNA 
n c /vi ) ORIGINAL SOURCE : 

15 (a) ORGANISM : Trabulsiella guamensxs 

(B) STRAIN: ATCC 49490 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

20 GGCGCAATCC TGGTAGTAGC AGC^ACTGAC GGCCCGATGC ~CTCO 
TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTU qgaActggTA 
TCCTGAACAA ATGCGACATG GTTGATGACG ££££ CGGGCGATGA 
GAGATGGAAG TTCGTGAACT ^Tggg TACGAX qGCGACGCAG 

25 CACGCCGATC GTACGTGGTT CTGCTCTGAA £^^;f™ TTCTTACATT 
AGTGGGAAGC GAAAATCATC GAACTGGCAG GTTTCCTGGA TTCTCACA^ 
CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTOC 
CGTATTCTCC AT CTCTGGTC GTGGTACCGT ^ACCGGi ^ 
GCGGTATCAT CAAA^GGT «*»Jg™ JAATCGXT AACTGCTGGA 
30 ACTGCGAAGT CAACCTGTAC TGGCGTAGAA QGTATCAAAC 
CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT ^TOCl^ ^^tcaaC 
GTGAAGAAAT CGAACGTGGT CA^ACTGG CTAAGCCG 

CCGCA ^^ *?SSS£ct tSaSggSa CCGTCCGCAG TTCTACTTCC 
CGGCCGTCAC ACTCCGTTCT JCAAA^lia CGTAGAGATG 

35 SSSSSS gSa™ caaaatgSt? gttaccctga tccacccgat 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 



40 



45 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 234 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 

PGGCGCTATC TTGGTTGTAT CCGCAGCTGA CGGCC CTATG CCTCAAACTC 
^CGCTATC i,i,^i,^i CT cgcCAAGTTG GTGTTCCTGC AAT CGTAGTA 

ttSt"S££ Sgctga?at ggttgIcgat gaagaattga tcgaattggt 
ISaaItSSaa gttcgtgaac ttctttcttc ctacgaattc cctggcgacg 

271 
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AAGTACCTAT CGTTGTAGGT TCCGCGTTGA AAGCTTTGGA AGGCGATGCT 250 

CAATATGTAG CTAAAATTGA CGAATTGATG GACGCTGTAG ACTCCTACAT 3 00 

CCCAACACCA GTTCGTGACA CTGATAAACC ATTCTTGATG CCTGTGGAAG 350 

ATGTTTTCAC AATCACTGGT CGTGGTACAG TAGCAACTGG CCGTGTTGAA 4 00 

5 CGTGGTCAAG TAAACGTTGG TGATACTGTT GAAGTAGTAG GCTTGAAAGA 4 50 

AAAAGCTGAA CAATACGTAG TAACAGGTCT TGAAATGTTC CGTAAAGTGT 500 

TGGATTCTGC AGTAGCAGGT GACAACGTAG GTGCATTGCT TCGTGGTGTT 550 

GATCGTAAAG ACATYGAACG TGGTCAAGTA TTGGCTAAAC CAGGTTCCAT 600 

CAACCCACAY ACAAAATT C A AAGCAGAAGT ATACGTATTG ACTAAAGAAG 650 

10 AAGGTGGTCG TCATACTCCA TTCTTCTCCA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACAA CAGACGTAAC AGGTGTTGTA AACCTTCCTG AAGGTGTAGA 750 

AATGTGTATG CCTGGCGATA ACGTAACAAT GGAAATCGAA TTGATTACTC 8 00 

CAATCGCTAT CGAAGAAGGT CTTCG 325 



15 
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2) INFORMATION FOR SEQ ID NO: 235 



(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 



CGCTATCCTG GTTGTTGCTG CAACTGATGG CCCAATGCCA CAGACTCGTG 50 

AGCACATCCT GTTGGGTCGT CAGGTTGGTG TTCCTTACAT CATCGTATTC 100 

3 5 ATGAACAAAT GTGACATGGT TGACGATGAA GAGCTGCTAG AACTGGTAGA 150 

AATGGAAGTG CGCGATCTTC TGTCTACCTA CGATTTCCCA GGCGATGATA 2 00 

CGCCAGTTGT TCGTGGTTCC GCGCTGAAAG CATTGGAAGG CGAACCTGAG 2 50 

TGGGAAGCAA AAATTATCGA ACTGGCTGGC TACCTGGATT CTTACATCCC 3 00 

AGAACCAGAG CGTGCTATCG ATAAGCCGTT CCTGCTGCCA ATCGAAGACG 3 50 

4 0 TATTCTCTAT CTCTGGTCGT GGTACTGTTG TAACGGGTCG TGTAGAGCGC 4 00 

GGTATCGTTA AAGTTGGTGA AGAAGT CGAA ATTGTTGGCC TGAAAGATAC 4 50 

CGTTAAATCT ACTTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGATG 500 

AAGGC CGTGC AGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGATATCG AACGTGGTCA AGTTCTTGCT AAACCAGGTT CGATTAAACC 600 

4 5 ACACACCAAA TTTGAATCAG AAGTTTATAT TCTGAGCAAA GATGAAGGTG 650 

GTCGCCATAC TCCGTTCTTC AAAGGCTACC GTCCTCAGTT CTACTTCCGT 700 

ACAACTGATG TAACCGGTAC TATTGAACTG CCAGAAGGCG TTGAGATGGT 750 

GATGCCAGGT GATAACATTC AAATGATTGT TAACCTGATT GCTCCTATCG 800 

CAATGGATGA CGGCTTGCGC TTTGC o^c 

50 a ^ b 

2) INFORMATION FOR SEQ ID NO: 236 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DN A 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: yersinia fredenJcsenii 

5 (B) STRAIN: ATCC 33641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 

SSSE SS ?SS c c SSE5S if. 

US ill ? SSSSS ssssa = 

- 1 ill &s sssss ssss = 

SSStCCT TAAAGTCGGC GAAGAAGTCG AAATCGTTGG TATCATTGAT 450 

- nan i 

llii sssss sssss sssss sssss 

25 CGCAATGGAT GACGGTCTGC GCTTTGCG 



2) INFORMATION FOR SEQ ID NO: 237 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (d) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (a) ORGANISM: Yersinia intermedia 

(B) STRAIN: ATCC 29909 



CTTGGTTGTC GCTGCAACTG ATGGTCCTAT GCCACAGACT CGCGAGCACA 

tcctoSagg ?cgtcaggtg GGTGTTCCTT ACATCCTGGT cttcctgaac 

AACTGTGACA TGGTTGACGA TGAGGAGTTG CTGGAATTGG TAGAAATGGA 
^^^7? p^ptGTCTC AATATGATTT CCCTGGCGAT GATACTCCTG 

iSSSSS SSSSSS 5JSotigg aaggcgagcc tgaatgggaa 
SvISStta tcgaattagc tgaggcgctg gatagttata ttccacagcc 
SSSS55S attgatagac Sttcttgct gccaatcgaa gacgtattct 
c?a?c?cagg tcgtggtaca gtcgtcactg gtcgtgtaga gcgtgggatc 

OTTAAAOTTG GCGAAGAAGT TGAAATCGTT GGTATTATCG ATTCCATTAG 
AACAACATGT ACTGGCGTTG AAATGTTCCG CAAATTGCTG GACGAAGGCC 
PTPCGGGTGA GAACGTTGGT GTTCTACTGC GTGGGACTAA ACGTGATGAC 
g?acag?gS GTCAGGTATT AGCTAAGCCA GGTTCTATCA AGCCACATAC 
TAAATTCGAA TCCGAAGTTT ATATTCTGAG CAAAGATGAA GGCGGGCGTC 
ACACGCCGTT CTTCAAAGGC TACCGTCCTC AGTTCTACTT CCGTACAACG 

gSgtSccg gtactattga attgccagac ggcgttgaga tggtgatgcc 
ag5?SJSc attcaaatga ttgttaacct gattgcacct attgcgatgg 



CA 02283458 1999-09-28 



ATGATGGTCT GCG 
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2) INFORMATION FOR SEQ ID NO : 238 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 

GGAGCGATCT TGGTTGTTGC TGCAACCGAT GGCCCTATGC 
TGAGCATATC CTGCTGGGCC GCCAGGTTGG TGTCCCATAC 
TCCTGAACAA ATGTGACATG GTTGACGATG AAGAGTTGCT 
GAAATGGAGG TTCGTGAGCT TCTGTCTCAA TACGATTTCC 
CACTCCAGTC ATCCGTGGTT CAGCGTTGAA AGCCCTGGAA 
AGTGGGAAGC TAAAATTATC GAGTTGGCAG AAGCTCTGGA 
CCGCAACCAG AACGCGCTAT TGATAGAC C A TTCCTATTGC 
CGTATTCTCT ATTTCTGGTC GTGGTACTGT AGTTACTGGT 
GTGGTATTGT TAAGGTCGGC GAAGAAGTTG AAATCGTTGG 
ACGATTAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGCA 
TGAAGGCCGT GCTGGTGAAA ATGTTGGTGT TCTGCTGCGT 
GTGACGATGT TCAGCGTGGT CAAGTACTGG CGAAACCAGG 
CCACACACGA AGTTTGAGTC AGAAGTTTAT ATTCTGAGCA 
CGGC CGT CAT ACACCGTTCT TCAAGGGCTA CCGTCCTCAG 
GTACAAC TGA CGTGACCGGT ACCATTGAGC TGCCAGAAGG 
GTCATGCCTG GTGACAACGT AAACATGGTT GTTAAC CTAA 
CGCAATGGAT GATGGTCTGC GCTTCGCAA 



CGCAGACTCG 50 

ATTATTGTCT 100 

AGAGTTGGTT 15 0 

CAGGCGACGA 2 00 

GGTGACGCTG 250 

TAGCTATATT 3 00 

CAATTGAAGA 3 50 

CGTGTAGAAC 400 

TATTATCGAT 45 0 

AGCTGCTGGA 5 0 0 

GGTACTAAGC 55 0 

TTCTATCAAG 600 

AAGATGAAGG 650 

TTCTACTTCC 700 

CGTTGAAATG 750 

TTGCTCCTAT 8 00 
829 
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2) INFORMATION FOR SEQ ID NO: 23 9 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 

TGGAGCGATC TTGGTTGTTG CTGCAACCGA TGGCCCTATG CCGCAGACTC 

GTGAGCATAT CCTGCTGGGC CGCCAGGTTG GTGTC CCATA CATTATTGTC 

TTCCTGAACA AATGTGACAT GGTTGACGAT GAAGAGTTGC TAGAGTTGGT 

TGAAATGGAG GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCAGGCGACG 

274 
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ACACTCCAGT CATCCGTGGT JCAGCGTTGA AJGCCCTGGA AGGTGACGCT 
GAGTGGGAAG CTAAAATTAT JGAGTTGGCA ^ CCAATTGAAG 
TCCGCAACCA GAACGCGCTA "GATAGACC ATTCCTAl^ tcgtgtagaa 
ACGTATTCTC TATTTCTGGT CGTGGTACTG ^ GTATTATCGA 
5 CGCGGTATTG TTAAGGTCGG GGAAGAAGTT ggj^ AAGCTGCTGG 
TACGATTAAA ACAACTTGTA CTGGCGTTGA TGGTACTAAG 
ATGAAGGCCG TGCTGGTGAA AATGTTGGTG ™^AeCAG gtTCTATCAA 
CGTGACGATG TTCAGCGTGG TCAAGTACTG £ c aaagatGAAG 

GCCACACACG AAGTTTGAGT CAGAAOTTA TAU GTTCTACTTC 
10 GCGGCCGTCA JACACCGTTC TTCAAGGGCT ^CAGAAG GCGTTGAAAT 

SSS Stggt tgttaaccta attgctccta 

TCGCAATGGA TGATGGT 



15 



20 



2) INFORMATION FOR SEQ ID NO: 240 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: , , . 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCgTG ™££ 
GCGAGCACAT CCTGTTGGGT ^GTCAAGTGG ^TGTTCCT XGG AACT GGT 
TTCCTGAACA AGTGTGACAT ^GTTGACGAC JAAGA^ CCTGGCGATG 
TGAAATGGAA GTTCGTGAGC TTCTGTCTCA ™ AGGCGAGGC C 

ACACTCCGGT TATTCGTGGT TCCGCGCTGA ^ Q aTAGCTACAT 

GAGTGGGAAG CCAAAATTAT JGAACTTGC1 £ C CAATCGAAG 

TCCACAGCCA GAGCGCGCGA ^TGATAAACC A^iux GCGTGTTGAG 
40 ACGTATTCTC TATCTCAGGC CGTGGAACAG JTGTTACCGG ^ 
CGCGGTATCG TCAGAGTGGG CGAAGAAG^T J^<£ AAACTGCTGG 
CAC C ATT AAA ACCACTTGTA CCGGTGTTGA ^™ri CGGTACTAAA 
ACGAAGGCCG TGCCGGTGAG ^JGTTTCTG JTCT^x GTTCTATTAA 
CGCGATGACG TGCAACGTGG TCAAGTGTTG ^CTAAA^ aAAGATGAAG 
45 GCCGCATACC ^TTGAGT JAGAAGTTTA JATTCTGA^ QTTCTACTTC 
GTGGTCGTCA TACTCCGTTC JTCAAAGGii QTGTTGAGAT 
SSaS SSSSSS TGTTAACCTG ATTGCGCCTA 
??SSJ?SS iGACGGTCTA CGATTTGCA 
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2) INFORMATION FOR SEQ ID NO: 241 

„ (i) SEQUENCE CHARACTERISTICS: 

M ( A ) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

817 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 
5 (B) STRAIN: ATCC 35313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 

TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

10 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACTCCGAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 2 50 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

15 CCCAGAACCA GTTCGTGCTA TCGACCTGCC GTTCCTGCTG CCGATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGCACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 5 00 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 5 50 

20 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GTTCCATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA TATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

25 TCGC 8Q4 



30 



2) INFORMATION FOR SEQ ID NO: 242 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 849 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 < A > ORGANISM: Achromobacter xylosoxidans subsp . 

deni tri fi cans 
(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 

TCAGTTCCCC CGCGATCACA TGCCCAAGAT CTACGAAGCG CTTACTCTGG 
CCGACGAGGG TTCCTCGTTC GCCGAAAAGG GTCTGACGCT GGAAGTGCAG 
CAACAGCTGG GCGACGGCGT GGTGCGTACC ATCGCGCTGG GCTCCAGCGA 
CGGCCTGCGC CGCGGTATGA AGGTCACCGG TACGGGCGCG CCGATCTCGG 
50 TGCCGGTCGG CACCGGCACG CTGGGCCGCA TCATGGACGT GCTGGGTCGT 
CCCATCGACG AAGCCGGCCC GATCCAGCAC GAAGAAAAGC GTGGCATTCA 
CCAGCCGGCT CCCCGTTTCG ACGAACTGTC GCCGTCGGTG GAACTGCTGG 
AAACCGGCAT CAAGGTTATT GACCTGGTCT GCCCGTTCGC CAAGGGCGGC 
AAGGTCGGCC TGTTCGGCGG CGCCGGCGTG GGCAAGACCG TCAACATGAT 4qn 
55 GGAAC TGAT C AACAACATCG CCAAGCAGCA CAGCGGCTTG TCGGTGTTCG 50 0 

CCGGCGTGGG CGAGCGTACC CGCGAAGGCA ACGACTTCTA CCACGAAATG iSn 
GAAGAGTCGA ACGTTCTGGA CAAGGTTGCG ATGGTGTTCG GTCAGATGAA 
CGAACCCCCG GGCAACCGTC TGCGCGTGGC GCTGACCGGC CTGACCATGG 
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600 

™ GAA ^ T CCGCGACGAA GGCCGCGACA TCCTGTTCTT StgoS llo 
SO ATCTACCGCT ACACCCTGGC CGGTACGGAA GTGTCCGCGC TGCTGGGCCG 75^ 
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2SSS SHE SSSSS SSSS SSK? 

2) INFORMATION FOR SEQ ID NO: 243 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 787 bases 
(■a) TYPE: Nucleic acid 
10 (c STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(Vi)( ^ G IN S™! : A cinetobacter bau.annii 

( B ) STRAIN: ATCC 19606 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
TGAAACTACT TTAGAAGTTC TGGTGATGGT ? TTGTTCGTA 

CCATCGCAAT GGGTTCTACA GAAG^TCTTA £££ GCCA C TCTTGGCCG 
aSScAAACG CACCGATCTC TGTTCCAGTT ^JCAG^ CCTGTTG cga 

tatcatggac gttttaggtc gt ^ctaj.uua tctta xgctgaacaa 
25 SSaagaacg tttgccgatt caccgtcaag cgccttctta cttact 
gcagcttcta ctgacctttt agaaactggt attaaag < ^ gcsma 
ttgcccgttt gcgaaaggtg gtaa^gttgg ttta caacat CGCGAAAG ca 

TTGGTAAAAC CGTTAACATG ATGGAATTbrA CGTA CT CGTGAAGG 

CACTCAGGTT TATCTGTGTT TGCTGGTGTT GGiU* GACAAAGTAG 
TAATGACTTC TATCACGAAA TGAAAGATTC Q -pTTACGCGTA 

CAATGGTCTA CGGTCAGATG AACGAGCCAC aaaaaqaCGA 
GCGTTAACTG GTTTGACTAT GGCTGAATAC JTC^^ ATCT ACCGTT 
AAACGGTAAA GGTCGTGACG JATTATTAii CG TATGCCATCT 

35 ATACACTTGC JGGTACTGA^ JTATCAGCAT gaaatgggtg ^TTCAAGA 750 

GCGTATTACA £££££ cSgGTTCGAT CACTTCG 

40 2) INFORMATION FOR SEQ ID NO: 244 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
45 (c) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(B) STRAIN: CDCF 3697 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 
GTTCCTAAGA TCTATGACGC TCTCCACGTT GATGGCACTG AAACTACATT 

agaagttcag caacaacttg gtgatggcgt agttcgta^ cactaacgcg 
ZSESSSi SSSSSS SSZSK ctaggtcgta tcatggacgt 
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^TTGGTCGC CCAATCGACG AAGCTGGTCC AGTTGCGACT GAAGCGCGTT 250 

TGCCGATTCA CCGTCAAGCA CCTTCTTATG CTGAACAAGC AGCTTCTACT loo 

GACCTTTTAG AAACTGGTAT TAAAGTCATC GACTTACTTT GCCCGTTCGC 350 

TAAAGGTGGT AAAGTTGGTC TGTTCGGTGG TGCCGGTGTT GGTAAAACTG 4oS 

5 TAAACATGAT GGAGTTGATC AACAACATCG CTAAAGCGCA CTCAGGTTTA tin 

TCTGTATTCG CTGGTGTTGG TGAGCGTACT CGTGAAGGTA ATGACTTCTA 500 

AAAGACTCAA ACGTTCTAGA CAAAGTAGCA ATGGTCTACG t 50 

GTCAGATGAA CGAGCCACCG GGTAACCGTT TACGCGTAGC GTTGACTGGT Ion 

TTGACCATGG CTGAGTACTT CCGTGACGAG AAAGACGAAA ACGGCAAaIg 650 

10 CCGTGACGTA CTATTGTTCG TAGATAACAT CTACCGTTAT ACACTAGCAG 7nS 

GTACTGAAGT ATCAGCACTT CTAGGTCGTA TGCCGTCTGC AGTAGG??a? 7^ 

CAACCGACAC TTGCAGAAGA GATGGGTGTT CTTCAAGAAC GTATTaStc Inn 

GACTAAGTCT GGTTCGATTA CGTCA TACATC 800 

825 

15 
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2) INFORMATION FOR SEQ ID NO: 245 

( i ) SEQUENCE CHARACTERI STI CS : 
20 (A) LENGTH: 83 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG-197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245 

SS JgSSSSE £££££ *• 

35 SESSSSS S22J2™ ™£?S ££S III 

200 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA j£SSSS£ III 
40 AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA Inn 

450 
500 
550 
600 
650 
700 



TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 
CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT tSctESJ? 

£*S™™F ™ CGTGAT CoiSSS gaSSgc^c? 



TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 
TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTgSc oSSJJStc 
AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGAtSSSS 
AC GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG McSeCTOT 
45 TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCr 
GTGATGTACA AGGACAAGAT GTTTTACTAT TTATOGATAA ^ATATTPACA 
TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTc" OTATgS5?£ 
AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAA^SSS Inn 
AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC CAATTACAAG 800 
50 o o 7 



2) INFORMATION FOR SEQ ID NO: 246 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 851 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



278 
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(ii) MOLECULE TYPE: Genomic DNA 

^^JS"- Alcaldes faecaiia 
5 (B) STRAIN: ATCC 15554 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 
TTC CCCCGCG ACAGCATCCC TAAAGTCTAC GAAGCATTGA CGCTCGTTGA 
10 CGAAAGTTCG GCTTTCGCAG AAAAAGGCCT GACTTTU^ caqcGACGGC 
AATTGGGTGA CGGTGTAGTT OGCWXMCG CCATGGG^ TCTCCGTTCC 
CTGCGCCGCG GTATGGAAGT ^GGTTCG ^ TQ GGTCGC CCTA 

CGTGGGTGTC GGCACCCTGG ^CGCATTAT ^CG CATTCACCAG 
TTGACGAAGT CGGTCCTATT CAGTCCGACG AGO c TGCXGGAAAC 

15 CCTGCGCCTA CTTTCGACGA ACTGTCGCC1 GGTGGTAAGG 
CGGTATTAAA GTGATTGACC TGGTTTGCCC Gl^ CATGCTGGAG 
TTGGTCTGTT CGGTGGTGCC g^GTGGGCA AGA TGTTTGCCGG 
CTGATCAACA ACATCGCCAA GGCACACAGL £ ^ qaaaTGGCCG 

Stgggtgag cgtacccgtg aaggtaacga cttctaccau agtggccatg 
20 atgctggcgt tatccagatg gacaacctga Jcgag gtgtggcact 
gtgttcggtc agatgaacga acctccaggc ^ cgtgacatcc 
gtccggcctg accatggccg agaagttccg tga gq tacagaagtg 
tgttctttgt ggacaacatc taccgciaua acc agcctacgct 

25 GGCCGAGGAA ATGGGTAAGC TGCAAGAGCG CATTACCTCC ACCAAGACCG ... 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 



30 2) INFORMATION FOR SEQ ID NO: 247 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 
(c) STRANDEDNESS : Double 

3b (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: ^rhracis 

4U v (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: 4229 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 
" OATGGCGOAA AGCTACCAGA TTAACTTAAC SS ZSSSSS 

ssssse ^sss| ssssgj SS3SS5 

gttcgtggca cagaagtaga agatactggt gatgcaattg 

50 TGGTGATGCA ACACTTGGTC gJGT^TTTAA U AATTCACCGT 
ACTTAGATGG TGAGGTTCCT JCGGATGTAO ^ ^ TTCTTGAAAC 
CAAGCACCTG CATTCGAAGA ATTATCTACT aa* GGTGGTAAGA 
TGGTATTAAA GTAGTAGACT JACTTGCTCC TTACA1 1 ^tcaggaA 
TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGG TATTCGCTGG 

55 TTAATCAATA ACAT CGCACA AGAACACGGT ^EJL.i^ QAAATGAGCG 
TGTAGGTGAG CGTACTCGTG AGGGTAATGA ^ATACCAC 
ATTCTGGCGT AATTAAGAAA ACTGCGATGG ™^ CAATGGCTGA 
CCACCTGGAG CACGTCAACG TGTTGCGTTA aua ATCGATAATA 

60 tcttcStS SSSSSS SSctgaag tSSgccct TCTTGGCCGT 



50 
100 
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200 
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ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATCfY-rra 
ATTACAAGAG CGTATTACAT CTACAAATAA SgS^ATC aIStC^ U°° 



2 ) INFORMATION FOR SEQ ID NO: 248 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 810 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 

^^ATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 
n™™™ ACATT TGAAG TTGCACTTCA TTTAGGTGAT GATACAGTT? 
oc GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAActS 

25 gaagatactg gtaaagcaat ctctgtacca gttggtgatg SSacttS 
acgtgtattc aacgtattag gtgatgcaat tgacttaga? ggtSJ???? 

CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGcS? T^CA^SX 

gaattatcta ctaaagtaga aattcttgaa actggtatta aa^IctSS 

S3SSS 5552X2* ™ccS JJgSSSS 



50 
100 
150 
200 
250 
300 

3 0 CCGGCGTAGG TAAAACAGTA ?55S?SSS SSSESS SSSggS ill 

500 
550 
600 

35 aggacaagac gtacttctgt tcatcgataa ca^cttccgS SSSSK ™° 
cgggttctga agtatctgcc cttcttggtc gtatgccatc ?gSS????? «2 
2£S£S£ cacttgcaac agaaatgggt caaSg 2SS3S5 SSS 

810 

40 



SSSSSS 00 gtggt ^ctc* tgtattcgct ggtgtaggtg SSSSSS 

!™™ T ^CTTATACC ACGAAATGAG CGATTCTGGC GTAaJcSS 
AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AG^rrr??? 
CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCa55tc? GtgJSSSJ 

SESSSSE «™£ SSSSS2 



2 ) INFORMATION FOR SEQ ID NO: 249 

( i ) SEQUENCE CHARACTERISTICS • 
45 (A) LENGTH: 944 bases 

IB) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroidea distasonia 

(B) STRAIN : ATCC 8503 

5 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 
CTTTGATTAT GGGACAGAAG AAACAGTAAC CCTCCCCCPT jTnniPnn^ 

280 
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CGGATTGAGA CGAGGCATGG AGGCCGTGTC ATACGGAATG CCCATCACCA 
TGCCGACCGG CGACCAAGTC AAAGGACGTT JAATG TTCCCATCCA 
CCTATCGATG GCATGGCCCA GCTTACTAAA ^ ^ qaggTCTTGT 

TCGTGAGCCT CCTAAATTCG AGGATCTGAC AA caaagGAGGC 

5 acacgggtat taaagtaatc Jatttattgg ^^^ cgg TATTGAT CAT 

AAGATCGGAC TTTTCGGAGG AGCCGGAGTC c ^ G t TTC TCCGT CTTCG 
GGAATTGATC AACAACATCG ^JAAAAA ^ GCGTGAAATG 
CCGGTGTGGG TGAACGTACA CGTGAAGGAA AG AGTATGGA 

SSSmcco gtgtcatccg atacggcgag gagtttaaga agA gctaagt 

10 GGCAGGCAAC TGGGACTTGT CGAAGATCGA Jlg^ ggGCGCCCGC 
CGCAGGCTAC CTTGGTATTC GGGCAGATGA JgHttC tccgtgacaa 
TCGTCCGTGG CCTTATCCGG TTTGACGAiL. £ GATAATATCT 
AGCCTCTGAG GGAGAAAGAA AAGATATATT ATTCT1UA GGGGCGTATG 
TCCGTTTCAC CCAAGCCGGT JCCGAGGTTT CCGCCTT^ TGGGA GCTAT 

15 GcSgAGCG? SSSS SSSSK SSSCATCACC TCCG 



200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

850 

900 

944 



20 2) INFORMATION FOR SEQ ID NO: 250 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
Zb (D ) TOPOLOGY: Linear 



( i i ) MOLECULE TYPE: Genomic DNA 

(B) STRAIN: ATCC 8483 
(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 250 

35 TTGAGGGTAC ?5^ C ^^ A > G ^ G caaAATACTG GTTGTAGAAG TTCAGCAACA 
GAGATAAAGA GGCCAAACGG CAAAATACiu ^ QT ACTGA CGGAC 

TATCGGCGAA AATACGGTGC ^ACCGTAGC ^GGAUA^ 
TTCAGAGAGG CATGAAAGTG TATCCCACUG GTCG QTGATTCGAT 

ATTGGCGAAC AGATTAAAGG ACGACTGATG A TTCATCGCG 
CGACGGTATG AAAGGACTCG ^CGCAAAGG TGCATATTCC gctcttcaca 
AC CCCCCTAA GTTTGAGGAT TTGACTACTG Q QTGGTAAAAT 

GGTATCAAAG TGATCGACCT g£TCGAACCG 1 AT TCAGGAAC 
CGGTTTGTTC GGCGGTGCCG JTGTAGGAAA ^ AT TTGCCGGA 
TTATCAATAA TATCGCCAAG AAA^AATG ?? G CTGCGCG AAATGATTGA 
GTAGGTGAAC GTACCCGTGA AGGTAACGAC JJg^ A TGGAGAAAG 
ATCCGGTGTA ATCCGTTACG gCGAAGCATT jgAAGAA gAAATCGCAA 
GTCACTGGGA TCTTTCGAAA gTGGATTATA ACGAA q 

GTGTCTCTGA ttttcggtca gatgaacgag ^ gacgcaggaa 

SSSSS SSS3S HIE *™ SSSSSSS 

SS £|™ Sgjggg. ceo,™ 



40 



45 



50 



55 



60 



SS £Sccc£ £Uo5S™ tccctcco 

2) INFORMATION FOR SEQ ID NO: 251 
(i) SEQUENCE CHARACTERISTICS: 
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10 



(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 51 



CGAATTCCCT 

15 AGAATGGTAA 
GGTATTGTGC 
TCTGGAAGTT 
CAACCCTGGG 
GGCGACATCG 

20 CTATGAAGAG 
TTATCGACCT 
GGCGGTGCGG 
CATCGCGATC 
GTACTCGTGA 

25 CTGGACAAAG 
CCGTCTGCGC 
ACGAAGGTCG 
CTGGCCGGTA 
AGGCTACCAG 

3 0 TCACCTCTAC 



CAGGATGCCG 

TGAGAGCCTG 

GTACCATCGC 

AAAGACCTCG 

CCGTATCATG 

GCGAAGAAGA 

CTGTCCAGCT 

GATGTGTCCG 

GTGTAGGTAA 

GAGCACTCCG 

GGGTAACGAC . 

TATCCCTGGT 

GTTGCGCTGA 

TGACGTACTG 

CGGAAGTATC 

CCGACCCTGG 

CAAAACCGGT 



TACCGCGCGT 

GTGCTGGAAG 

CATGGGTTCT 

AGCACCCAAT 

AACGTCCTGG 

GCGTTGGGCT 

CTCAGGAACT 

TTCGCGAAGG 

AACCGTAAAC 

GTTACTCCGT 

TTCTACCACG 

TTACGGCCAG 

CCGGCCTGAC 

CTGTTCGTTG 

CGCACTGCTG 

CGGAAGAGAT 

TCTATCACCT 



GTACGATGCT 
TTCAGCAGCA 
TCCGACGGTC 
CGAAGTACCA 
GTCAGCCGAT 
ATCCACCGTT 
GCTGGAAACC 
GCGGTAAAGT 
ATGATGGAGC 
GTTTGCAGGC 
AAATGACCGA 
ATGAACGAGC 
CATGGCTGAG 
ACAACATCTA 
GGTCGTATGC 
GGGTGTTCTG 
CCG 



CTTGAGGTTC 50 

GCTCGGCGGC 100 

TGCGTCGTGG 15 0 

GTAGGTAAAG 200 

CGACATGAAA 2 50 

CAGCACCTTC 3 00 

GGCATCAAAG 3 50 

TGGTCTGTTC 4 00 

TGATCCGTAA 450 

GTGGGTGAGC 5 00 

CTCCAACGTT 550 

CACCAGGAAA 6 00 

AAGTTCCGTG 650 

CCGTTACACC 70 0 

CATCAGCAGT 750 

CAGGAACGTA 8 00 
833 



35 



2) INFORMATION FOR SEQ ID NO: 2 52 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 ORGANISM: Stenotrophomonas maltophllia 

(B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 

50 GTGTACGACG CACTGAAGGT GGAAAACACC GAGATCACCC TCGAAGTCCA 
GCAGCAGCTG GGCGACGGCG TGGTGCGTAC CATCGCCCTC GGTTCCACCG 
ACGGCCTGAA GCGCAACCTG GTTGCCGTCA ACACCGGCCG TGGCATCTCG 
SJ™S^ G GCGCCGGCAC CCTGGGCCGC ATCATGGACG TGCTGGGCCG 
„ ?£CGATCGAC GAAGCCGGCC CGGTCGCTGC CAGCGACAAC TGGGAAATCC 250 
bb SKSKX ^GTCGTAT GAAGACCAGT CCCCGGCCAC cSSSESSctS 



GAAACCGGCA TCAAGGTCAT CGACCTGATG TGCCCGTTCG CCAAGGGCGG 
CAAGGTCGGC CTGTTCGGCG GCGCCGGCGT CGGCAAGACC gSJSSSS 
SXS^ GCCAAGGCGC ACAGCGGCCT S?S5SSS5 



50 
100 
150 
200 



300 
350 

GCCGGCGTGG CTGAGCGTAC CCGTGAGGGC 3KSSSSS Ac'S2gIS? ill 
60 GAAGGACTCC AACGTCCTGG ACAAGGTGGC GATGGTGTA? GGCC^S?^ 



550 
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45 



ssssas 

CTCTATTACC TCTG 

2) INFORMATION FOR SEQ ID NO: 254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

lVil <£f "SSaKE?' Bifidobacteria,, adoiescantis 



60 



(B) STRAIN: ATCC 15703 

283 



600 
650 
700 
750 



ACGAGCCGCC SGOCAACCGT CTGC3CGTCG ™££ gggSSS? 

sss sssss s SSSSSS SSESSSS 

TGGTTCGATC ACCTCGATC 

10 2) INFORMATION FOR SEQ ID NO: 253 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 
(c) STRANDEDNESS: Double 

15 (D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

» ( "'^ GIN ^Sm : Bar t o nS 21a henselae 
( B ) STRAIN: ATCC 49882 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 
25 AGTTTGAAGG CCCTTTGCCA JATATTCTCA ATGCATTAGA AACAGATAAT 
TTGGGCAATC GGCTAGTTTT AGAAGTTGCT CAGCA CGTGGTCAAA 
CGTGCGTACC ATTGCCATGG ^ACTACCGA TGGTCTTG^ AGAAGCAACA 
AAGTTTTTGA TACAGGAACA gA^CAGTG T ^ A TGTTGGCCC 

30 CTTGGTCGTA TTATGAATGT ®J™GAGMJ u CCTGAATATG 
AATTGCTACA AGCAAAACCC GTTCCATTCA C T XA AAGTCGTT 

TGGAGCAATC AACCGCATCA GAAATCC11U i TGTTTGGAGG 
GATCTGTTAG CTCCTTATTC TAAAGGGGGG ^TTG^ 
TGCCGGTGTT GGTAAAACCG TTCTCATTAi ^ TGAACGTACA 
35 CAAAGGCGCA TGGTGGCTAT TCAGTGTTTG CCGGTGTT^ GTGTGA ATGT 
CGTGAGGGAA ATGATCTTTA JTATGAAATG ATCGAAAG^ TGTGCACTCG 
GAATCCAAAA GACAACAATG GTTC^CAGA CGCGTGCACG TGTGGCTCTT 
TTTATGGGCA AATGAATGAA CCACCAG^ j. AC AAGATGTTTT 
TCAGGATTGA CCATTGCAGA AAGTTTCCGT qcTGAAGTGT 
4 0 GTTCTTCGTA GATAATATTT TCCGTTTTA^ A GCCA ACTTTG 

CAGCTCTTTT AGGGCGTATT CCTTCTGCTG TA caagaACAGG 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 

TTCCCGGTGG GCCACCTGCC CGACATTTAC AATGCACTTA CCGTTGAACT 5 0 

GKCCAACACC GGYGTCCACG AASAGGGCGA GACCACCAAG AAGATCACCC 100 

5 TTGMGGTTGA ACAGCATCTT GGCGATTCCA CCGTGCGTAC CGTCGCTCTG 15 0 

AAGCCGACTG ACGGCCTTGT GCGTGGCGCC ACCGTGTATG ACACCGGCGG 2 00 

CCCGATCTCT GTGCCGGTTG GCGATGTCAC CAAGGGC C AC GTATTCGACG 2 50 

TGTCCGGCAA CATCCTCAAC AAGAAGGCCG ACGAGACCGT TAAGGTTACC 3 00 

GAACGTTGGT CTATCCACCG TAACCCGCCG GCATTCGACC AGCTGGAGTC 3 50 

10 CAAGACCCAG ATGTTCGAAA CCGGTATCAA GGTCATCGAT TTGCTGACCC 4 00 

CGTATGTGCA GGGCGGCAAG ATCGGTCTGT TCGGCGGCGC AGGCGTCGGC 450 

AAGACCGTGC TGATCCAGGA AATGATTCAG CGTGTGGCTC AGAACCACGG 500 

CGGTGTGTCC GTGTTCGCAG GCGTCGGCGA GCGTACCCGT GAGGGTAACG 550 

ATCTGATCGG CGAAATGGAC GAAGCCGGCG TGCTCGAGAA GACCGCACTG 600 

15 GTCTTCGGCC AGATGGATGA GCAGCCGGGT ACCCGTCTGC GCGTGCCGCT 65 0 

GACCGCACTG AC C ATGGC AG AGTACTTCCG TGACGTACAG AATCAGGACG 700 

TGCTGCTGTT CATCGATAAC ATCTTCCGTT TCACCCAGGC TGGTTCCGAG 750 

GTGTCCACCC TGCTCGGCCG TATGCCGTCC GCAGTGGGCT ACCAGCCGAA 8 00 

CCTGGCCGAT GAGATGGGCG CGCTGCAGGA GCGAATCACT TCGACCCGTG 85 0 

2 0 GACACTCCAT CACCTC " 866 



25 



2) INFORMATION FOR SEQ ID NO: 255 



50 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 5 (A) ORGANISM: Brucella abortus 

(B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 

4 0 AAGGCCAGCT GCCGCTGATC CTGAACGCGC TTGAAGTGGA CAATCAGGGC 

CATCGTCTGG TTCTCGAAGT TGCCCAGCAC CTCGGCGAAG ACACCGTGCG 10 0 

CACCATCGCC ATGGACGCGA CCGAAGGTCT CGTTCGCGGT CAGGAAGCAC 150 

GCGACACTGG CGAACCGATC ATGGTGCCGG TCGGCGTCGA AACGCTTGGC 2 00 

CGCATCATGA ACGTCATCGG CGAGCCGGTT GACGAAG CAG GCCCCATCAA 2 50 

4 5 GACCAAGGCA ACCCGCGCCA TCCACCAGAA CGCGCCGGAA TATATCGAAC 3 00 

AGTCGACCGA AGCCGAAATT CTGGTCACGG GCATCAAGGT CGTCGACCTT 3 50 

CTGGCGCCTT ACGCCAAGGG CGGCAAGATC GGCCTCTTCG G CGGTGCAGG 4 00 

CGTCGGCAAG ACCGTTCTCA TCATGGAACT CATCAACAAC GTCGCCAAGG 4 50 

CGCACGGCGG TTATTCCGTG TTCGCAGGCG TCGGTGAGCG TACCCGTGAG 5 00 

50 GGCAACGACC TTTACCACGA AATGATCGAG TCGGGCGTGA ACAAGCTCGG 550 

CGGCGGCGAA GGCTCCAAGG CAGCCCTCGT TTACGGCCAG ATGAACGAAC 6 00 

CCCCGGGTGC CCGCGCCCGC GTTGCCCTTT CCGGTCTGAC GGTTGCTGAA 6 50 

AACTTCCGTG ACCAGGGCCA GGACGTTCTG TTCTTCGTGG ACAACATCTT 700 
CCGCTTCACG CAGGCAGGTT CGGAAGTGTC GGCTCTTCTC GGCCGTATTC 

" GGGTTATCAG CCGACGCTGG CAACCGACAT SScGCCATG 800 

CAGGAACGCA TCACCACGAC GACCAAGGGT TCGATCACCT CG 842 



60 2) INFORMATION FOR SEQ ID NO: 256 
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( i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 833 bases 
in) TYPE: Nucleic acid 
( C ) STRANDEDNESS : Double 
5 ( D ) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

1 ( A ) ORGANISM: Cedecea davisae 
(B) STRAIN: ATCC 33431 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 
15 CGAGTTCCCT CAGGACGGCG TACCGCGCGT TKATGATGCG CTTGAAGTAC 
AAAATAACAG CGAGCAGCTG GJGCTGGAAG XGCGTCGTGG 
GGTATCGTGC GTACCATCGC JATGGGTTCT TCCGA q q^qq T aAAG 
TCTGGAAGTT AAAGACCTCG JGJACCCGAl £ T CGAC ATGAAA 

2 0 CGACCCTTGG CCGTATCATG AAgGTGCTGG ^CA^ CTGCA CCTTC 

GGCGATATCG GCGAAGAAGA CCGTTGGGCT Ai c GGCA TCAAAG 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT ^ T AGGTCTGTTC 

TAATCGACCT TATCTGTCCG TTCGCTAAGG ATGATGGAGC TTATCCGTAA 
GGTGGTGCGG GCGTGGGTAA ^GTAAAC AT ^ GTGGGTGAG C 
2 5 CATCGCGATC GAGCACTCCG GCTACTCCGT ^ QA CTCC AACGTT 

GTACTCGTGA GGGTAACGAC ^CTATCAUb ATGAACGAGC CACCGGGTAA 
CTGGACAAAG TTGCCCTGGT TTACGGCCAG £1 AAATTCCGTG 
CCGTCTGCGC GTAGCGCTGA CCGGTCTGAC CCGTTACACC 
ACGAAGGTCG TGACGTTCTG CTGTTCGTTG JTAACATCTA cTTCTGCGGT 

30 SSSSS S3SES gggtgttctt caggagcgta 

ca^accggt tccatcacct ccg 

35 2) INFORMATION FOR SEQ ID NO: 257 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

( B ) TYPE: Nucleic acid 
40 (C ) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: _ . 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257 

TTCCCTCAGG ACGGCGTACC ? CGCGTATAT OACGCGCTTG ^TACAGAA 
TAACAGCGAG AAGCTGGTGC TGGAAGTTCA JCAGCAGCTC TCTG 

TCGTACGTAC CATCGCAATG GGJTCTTCCG ACGGlUl^ G TAAAGCGAC 
GAAGTGAAAG ACCTCGAGCA CCCGATCGAA gJgJC^AG G 

TCTGGGTCGT ATCATGAACG TGCTGGGTCA ACCTTCCTAT 

ACATCGGCGA AGAAGATCGT TGGGCGATTC tCAAAGTTAT 

if™ S =i ssass ssss 
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10 



25 



55 



GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCAGGCGTGG GTGAGCGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAGAT GACCGACTCC AACGTTCTGG 550 

ACAAAGTTGC ACTGGTTTAC GGCCAGATGA ACGAGCCGCC AGGTAACCGT 600 

CTGCGC GTAG CGCTGACCGG TCTGACCATC GCGGAGAAAT TCCGTGACGA 650 

AGGCCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 

CCGGTACAGA AGTTTCTGCA CTGCTGGGTC GTATGCCATC TGCGGTAGGT 7 50 

TATCAGCCTA CTCTGGCAGA AGAGATGGGT GTTCTTCAGG AGCGTATTAC 8 00 

CTCCACCAAG ACCGGTTCCA TCACTTCCG ~ "" ~ 82 9 



2 ) INFORMATION FOR SEQ ID NO: 258 



( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 ( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neteri 

(B) STRAIN: ATCC 33855 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258 



650 
700 
750 



TCCCTCAGGA CGGCGTACCG CGCGTTTATG ACGCGCTTGA GGTACAGAAC 5 0 

AACAATGAGA AGCTGGTGCT GGAAGTTCAG CAGCAGCTCG GCGGCGGTAT 10 0 

30 CGTGCGTACC ATCGCAATGG GTTCTTCCGA CGGTCTGCGT CGTGGTCTGG 1«5 0 

TAGTAACAGA CCTCGAGCAC CCGATCGAAG TCCCGGTAGG TAAAGCGACC 200 

CTTGGCCGTA TCATGAACGT GCTGGGT C AG CCGATCGACA TGAAAGGCGA 250 

CATCGGCGAA GAAGACCGTT GGGCGATTCA CCGCGCAGCA CCTTCCTACG 100 

AAGAGCTGTC CAGCTCTCAG GAATTGCTGG AAACCGGCAT CAAAGTTATC 3^0 

35 GACCTGATTT GTCCGTTCGC TAAGGGCGGT AAAGTAGGTC TGTTCGGTGG 4 00 

TGCGGGCGTA GGTAAAACCG TAAACATGAT GGAGCTGATC CGTAACATTG 4 50 

CGATCGAGCA CTCCGGTTAT TCCGTGTTTG CGGGCGTGGG TGAGCGTACT 500 
CGTGAGGGTA ACGACTTCTA CCACGAAATG ACCGACTCCA ACGTTCTGGA 

TAAAGTAGCA CTGGTTTACG GCCAGATGAA CGAGCCACCA GGTAACCGTC 60 0 

4 0 TGCGCGTAGC GCTGACCGGT CTGACCATCG CGGAAAAATT CCGTGACGAA 
GGCCGCGACG TTTTGCTGTT CGTTGATAAC ATCTACCGTT ATACCCTGGC 
CGGTACCGAA GTATCTGCAC TGCTGGGTCG TATGCCTTCT GCGGTAGGTT 

ATCAGCCAAC TCTGGCAGAA GAGATGGGTG TTCTTCAGGA GCGTATTACC Ron 

TCCACCAAGA CCGGTTCTAT CACCTCCGTA ^lAlTACC 800 

45 ojU 

2) INFORMATION FOR SEQ ID NO: 259 

50 (i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Chryseobacterium meningosepticum 
SO (B) STRAIN: CDC B7681 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

TTTCAGGAAG TAGAGGAACT ACCAAACATT TATGACGCAC TTGAAGTTGT 50 

5 CAGAGAAGGC CAGAAAGGTC TGATCTTAGA AGTTGAACAA CACATCGGTG 100 

AGGATACAGT GAGATGTATC GCTATGGATG CAACAGACGG ACTTCAAAGA 150 

GGACAATCTG TAATAGGACA TGGAAGACAA ATTACTATGC CTATTGGTGA 20 0 

AGAAGTAAAT GGTAGATTGT TCAACGTGGT AGGAGATGCT ATCGACGGAC 250 

TTCAGGAATT ATCTAAAGAC AATGGTTTGC CAATC CACAG AGAAGCACCT 3 00 

10 AAATTCGATC AGTTATCAAC TTCTGCTGAA GTACTATATA CAGGTATCAA 350 

AGTAAT CGAC CTTAT CGAGC CTTATGCAAA AGGTGGTAAA ATTGGTTTGT 400 

TCGGTGGTGC TGGTGTAGGT AAAACAGTAT TGATCCAGGA ATTGATTAAT 450 

AACATTGCTA AAGGACACGG TGGTCTTTCT GTATTCGCAG GAGTAGGTGA 50 0 

GAGAACAAGA GAAGGAAATG ACCTTCTTCG TGAGATGTTA GAGTCTGGTA 550 

15 TTATTAAATA TGGTGACGAA TTCATGCATT CTATGGAGAA CGGTGGATGG 60 0 

GATCTTTCTA AAGTTGACAG TGAGTTGATG AAAGAGTCTA AAGCTGCTTT 650 

CGTTTTCGGA CAGATGAACG AGCCACCAGG TGCAAGAGCA CGTGTAGCCC 700 

TTTCTGGTCT TACTTTAGCT GAAT AC T AC C GTGATGGTGG CGAAAGCGGA 750 

CAAGGTAGAG ACGTTCTTTT CTTCGTAGAC AACATCTTCC GTTTTACACA 800 

20 GGCTGGTTCT GAGGTGTCTG CACTTCTAGG TCGTATGCCT TCAGCGGTAG 8 50 

" GTTACCAACC AACTCTAGCT TCTGAGATGG GTGCAATGCA GGAGAGAATT 900 

ACTTCAACTA AAAACGGATC TATTACATCT G 931 



25 



35 



2) INFORMATION FOR SEQ ID NO: 260 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 726 bases 
3 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter amalonatlcus 

(B) STRAIN: ATCC 25405 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 

GTATCGTACG TACCATCGCA ATGGGTTCTT CCGACGGTCT GCGTCGTGGT 50 

CTGGAAGTCA AAGAC CTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC 100 

AACCCTGGGT CGTATCATGA ACGTCCTGGG TCACCCGATC GACATGAAAG 15 0 

4 5 GCGATATCGG TGAAGAAGAG CGTTGGGCTA TCCACCGCGC AGCACCGTCC 20 0 

TATGAAGAGC TGTCCAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT 250 

TAT CGACCTG ATGTGTCCGT TCGCGAASGG CGGTAAAGTG GGTCTGTTCG 3 00 

GTGGTGCGGG TGTAGGTAAA ACCGTAAACA TGATGGAGCT TATCCGTAAC 3 50 

ATCGCGATCG AGCACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG 4 00 

50 TAC TCGTGAG GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC 4 50 

TGGATAAAGT ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCGGGAAAC 500 

CGTCTGCGCG TTGCACTGAC CGGTCTGACC ATGGCAGAGA AATTCCGTGA 55 0 

CGAAGGTCGT GAC GTACTGC TGTTCGTCGA TAACATCTAT CGTTACACCC 60 0 

TGGCCGGTAC GGAAGTTTCC GCACTGCTGG GTCGTATGCC ATCAGCGGTA 650 

55 GGTTACCAGC CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT 700 

CACTTCTACC AAAACCGGTT CTATCA 72 6 



60 2) INFORMATION FOR SEQ ID NO: 261 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter braakii 

(B) STRAIN: ATCC 43162 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 



15 



GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACTC TGGGTCGTAT 2 00 

2 0 CATGAACGTC CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTATGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG AC CTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 

2 5 TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCT 600 

CTGACCGGTC TGACCATGGC AGAGAAGTTC CGTGACGAAG GTCGTGACGT 65 0 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACAGAAG 700 

3 0 TATCCGCTCT GCTGGGTCGT ATGCCAT CAG CGGTAGGCTA CCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC AC 812 



35 



45 



2) INFORMATION FOR SEQ ID NO: 262 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 811 bases 
4 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citroibacter koseri 

(B) STRAIN: ATCC 27156 

50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 

GCGCGTGTAC GACGCCCTTG AGGTGCAGAA TGGTAATGAA CATCTGGTGC 50 

TGGAAGTT CA GCAGCAGCTC GGTGG CGGTA TCGTACGTAC CATCGCCATG 100 

GGTTCTTCCG ACGGCCTGCG TCGTGGTCTG GATGTGAAAG AC CTTGAGCA 150 

55 CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT ATCATGAACG 200 

TACTGGGCGA ACCAGTAGAC ATGAAAGGCG AGATCGGTGA AGAAGAGCGT 250 

TGGGCTATCC AC CGTGCGGC ACCGTCCTAC GAAGAGTTGT CAAACTCTCA 3 00 

GGAACTGCTG GAAAC CGGTA TCAAAGTTAT CGACCTGATG TGCCCGTTCG 3 50 

CGAAGGG CGG TAAAGTGGGT CTGTTCGGTG GTG CGGGTGT AGGTAAAACC 400 

60 GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAAC ACTCCGGTTA 450 
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CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT AACGACTTCT 500 

ACC AC GAAAT GACCGACTCC AACGTTATCG ACAAAGTATC CCTGGTTTAC 550 

GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG CGCTGACCGG 600 

CCTGACCATG GCGGAGAAAT TCCGTGACGA AGGTCGTGAC GTTCTGCTGT 650 

TCGTCGACAA CATCTACCGT TACACC CTGG CCGGTACGGA AGTATC CGC A 70 0 

CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TACCAGCCGA CCCTGGCGGA 750 

AGAGATGGGT GTTTTGCAGG AACGTATCAC CTCCACCAAA ACCGGTTCTA 800 

TCACCTCCGT A 811 



2) INFORMATION FOR SEQ ID NO: 2 63 



( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter farmer! 

(B) STRAIN: ATCC 51112 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 



GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTTATGA ATGGTAAAGA 50 

GAGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT ATCGTACGTA 100 

3 0 CCATCGCGAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CTCTGGGTCG 2 00 

TATCATGAAC GTCCTGGGTC ACCCGATCGA CATGAAAGGC GATATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGCGCAG CGCCATCCTA TGAAGAGCTG 3 00 

TCCAGCTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 3 50 

3 5 GTGTCCGTTT GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAACTCA TCCGTAACAT CGCGATCGAG 45 0 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGATTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTATA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 60 0 

4 0 GCGCTGACCG GTCTGACCAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCAT CAGCGGTAGG CTACCAGCCA 750 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CTT CTACCAA 8 00 

AACCGGTTCT ATTACC 816 

45 



2) INFORMATION FOR SEQ ID NO: 2 64 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: CitroJbacter freundil 
60 (B) STRAIN: ATCC 8090 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 



AGGATGCCGT 
5 GAGAGCCTGG 
TACCATCGCC 
AAGACCTCGA 
CGTATCATGA 
TGAAGAAGAG 

1 0 TGTCAAGCTC 
ATGTGTCCGT 
TGTAGGTAAA 
AACACTCCGG 
GGTAACGACT 

15 ATCCCTGGTA 
TTGCGCTGAC 
GACGTTCTGC 
AGAAGTATCT 
CGACCCTGGC 

2 0 AAAACCGGTT 



ACCGCGCGTG 
TGCTGGAAGT 
ATGGGTTCTT 
GCACCCGATC 
ACGTTCTGGG 
CGTTGGGCTA 
TCAGGAACTG 
TCGCTAAGGG 
AC CGTAAACA 
TTACTCCGTG 
TCTACCACGA 
TATGGCCAGA 
CGGTCTGACC 
TGTTCGTTGA 
GCACTGCTGG 
GGAAGAGATG 
CTATCACCT 



TACGATGCTC 
TCAGCAGCAG 
CTGACGGTCT 
GAAGTCCCGG 
TCACCCGATC 
TCCACCGTGC 
CTGGAAACCG 
CGGTAAAGTT 
TGATGGAGCT 
TTTGCGGGCG 
AATGAC CGAC 
TGAACGAGCC 
ATGGCTGAGA 
TAACATCTAT 
GTCGTATGCC 
GGTGTTCTGC 



TTGAGGTTAT 
CTCGGCGGCG 
GCGTCGTGGT 
TAGGTAAAGC 
GACATGAAAG 
AGCACCTTCC 
GTATCAAAGT 
GGTCTGTTCG 
GATCCGTAAC 
TAGGTGAACG 
TCCAACGTTC 
GCCTGGAAAC 
AGTTCCGTGA 
CGTTACACCC 
ATCAGCGGTA 
AGGAACGTAT 



GAATGGTAAA 5 0 

GTATCGTACG 100 

CTGGAAGTTA 150 

AACGCTGGGT 200 

GCGATATCGG 250 

TACGAAGAGC 3 00 

TATCGACCTG 350 

GTGGTGCGGG 4 00 

ATCGCGATCG 450 

T ACT CGTGAG 500 

TGGACAAAGT 550 

CGTCTGCGTG 600 

CGAAGGTCGT 650 

TGGCCGGTAC 700 

GGCTACCAGC 750 

CACCTCCACC 80 0 
819 



25 



2) INFORMATION FOR SEQ ID NO: 265 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cltrobacter koseri 

(B) STRAIN: ATCC 27028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 

4 0 GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTGCAGA ATGGTAATGA 50 

ACATCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATCGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGCCTGC GTCGTGGTCT GGATGTGAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 2 00 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GAGATCGGTG 250 

4 5 AAGAAGAGCG TTGGGCTATC CACCGTGCGG CACCGTCCTA CGAAGAGTTG 3 00 

TCAAACTCTC AGGAACTGTT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGAT CGAA 450 

CACTCCGGTT ACTC CGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

50 TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 55 0 

CCTTGGTTTA CGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 60 0 

GCGCTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTG CTGGGT CGTATGCCTT CAGCGGTAGG TTACCAGCCG 750 

55 ACCCTGGCGG AAGAGATGGG TGTTTTGCAG GAACGTATCA CCTCCACCAA 800 

AACCGGTTCT ATCACCTCCG TA 822 



60 2) INFORMATION FOR SEQ ID NO: 266 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter sedlakii 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 

GATGC CGTAC CGCGCGTGTA CGACGCCCTT GAGGTACAGA 
GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT 
CCATCGCCAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT 
GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA 
TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC 
AAGAAGAGCG TTGGGCTATC CACCGTGCCG CGCCGTCCTA 
TCTAACTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA 
GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT 
TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT 
CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA 
TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC 
CCCTGGTGTA CGGCCAGATG AACGAGCCGC CTGGAAACCG 
GCACTGACCG GTCTGACCAT GGCTGAGAAG TTCCGTGACG 
CGTTCTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG 
AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG 
ACTCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA 
AACCGGTTCT ATCACCTCCG 



ATGGTAATGA 50 

ATTGTACGTA 100 

GGAAGTAAAA 150 

CGCTGGGTCG 2 00 

GACATCGGTG 2 50 

TGAAGAGTTG 3 00 

TCGACCTGAT 350 

GGTGCGGGCG 4 00 

CGCGATCGAG 4 50 

CTCGTGAGGG 500 

GACAAAGTAT 550 

TCTGCGCGTC 600 

AAGGTCGTGA 650 

GCCGGTACGG 700 

TTATCAGCCG 750 

CCTCAACCAA 80 0 
820 



35 



45 



2) INFORMATION FOR SEQ ID NO: 2 67 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 806 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter werkmanil 

(B) STRAIN: ATCC 51114 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

55 CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACCC TGGGTCGTAT 200 

CATGAACGTC CTGGGTCATC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTACCTATGA AGAACTGTCC 3 00 

AGTTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

60 GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 
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TCCGGTTACT CAGTGTTTGC 
CGACTTCTAC CACGAAATGA 
TGGTATATGG CCAGATGAAC 
CTGACCGGTC TGACCATGGC 
5 TCTGCTGTTC GTTGATAACA 
TATCTGCACT GCTGGGTCGT 
CTGGCGGAAG AGATGGGTGT 
CGGTTC 



GGGCGTTGGT GAACGTACTC 
CCGACTCCAA CGTTCTGGAC 
GAGCCGCCGG GAAACCGTCT 
TGAGAAGTTC CGTGACGAAG 
TCTATCGTTA CACCCTGGCC 
ATGCCATCAG CGGTAGGCTA 
TCTGCAGGAA CGTATCACCT 



GTGAGGGTAA 500 

AAAGTATCCC 550 

GCGCGTTGCG 600 

GTCGTGACGT 650 

GGTACTGAAG 70 0 

CCAGCCAACC 750 

CTACCAAAAC 800 
806 



2 ) INFORMATION FOR SEQ ID NO: 268 

( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter youngae 

(B) STRAIN: ATCC 29935 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 
CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTAT? GTAcS?a2cS 

3 0 tcgccatggg ttcttccgac ggtctgcgtc gtggtctgga agttaaagac 

ctcgagcacc cgatcgaagt cccggtaggt aaagcaacgc tgggtcS?a5 
catgaacgtt ctgggtcacc cgatcgacat gaaaggcgat atcggagaag 
aagagcgttg ggctattcac cgcgcagcac cttcctatga agagSStc? 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGA?g5S 

35 **52S£2™ *?2™2!« ?ttcgg?gS? Scgggtg?;? 2SS 

450 
500 
550 
600 
650 
700 
750 



50 
100 
150 
200 
250 
300 



GTAAAACCGT AAACATGATG SESSSia GTAaStSc GAtIgISS? 
TCCGGTTACT CTGTGTTTGC GGGTGTAGGT GAACGTACTC GTGAGGGTAA 
CGACTTCTAC CACGAAATGA CCGATTCCAA CGTTCTGGAT AAa3?a?S£ 
TGGTTTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GC^TTCCG 

4 0 CTGACCGGTC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGAcS 
ACTGCTGTTC GTCGATAACA TCTATCGTTA TACCCTGGCC GGTACGGAAG 
TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCGaIS 
SSSS^ AGATGGGTGT TCTGCAGGAA CGTATCaSt" SSSSSS 

45 810 



2) INFORMATION FOR SEQ ID NO: 269 

( i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 69 

TTGAGAACGG AGATTTGCCG CAGC TATTGA CCGCTATTGA AATTCCTCTG 50 

5 AAAGACAGTG AATCTCTGAT TGTCGAAGTT GCTCAGCATA TCGGTGATGA 100 

ACGTGTCCGC TGTATCGCTA TGGGCGGTAC AGATGGTCTG GTTCGTGGAA 150 

TGGAAGC CAT TGATACAGGA TCCGCAATCC GTGTACCGGT GGGAAAAGAA 2 00 

ATTCTGGGAA GAATGTTCAA TGTCCTCGGA CGTGAAATTG ATGGTCTGGG 2 50 

ACCTGTAGGA ACGGATAACA CACTGCCGAT CCACAGACAG GCACCGGGCT 3 00 

10 TTGAGGAGCA GCAGACATCC GCAGAAATGC TGGAAACAGG AATTAAGGTC 3 50 

ATTGACCTGT TATGTCCATA TTCCAAGGGT GGTAAGATTG GTTTGTTTGG 4 00 

TGGTGCGGGA GTAGGTAAAA CCGTACTGAT TCAGGAGCTG ATTCATAATA 4 50 

TCGCCAAGGA ACATGGTGGA ATGTCCGTCG TTACCGGTGT AGGGGAGAGA 5 00 

ACCCGTGAAG GAAACGACAT GTATCATGAA ATGAAGGACA GCGGTGTCCT 550 

15 TGATAAGACC GTACTGGTTT ACGGACAGAT GAATGAATCA CCGGGTGCCA 6 00 

GAATGCGTGT CGGTCTGACC GGGCTGACGA TGGCGGAATA TTTCCGTGAT 650 

CACGACCATC AGGATGTATT GCTGTTTATT GATAATATTT TCCGTTTTAC 70 0 

CCAGGCGGGA AGTGAAGTAA GTGCCCTGCT GGGACGTATG CCAAGTGCAG 750 

TAGGCTATCA GCCGACACTT GCGACAGAAA TGGGACAGCT GCAGGAGCGC 800 

2 0 ATTACATCCA CGAAGGATGG TTCCATT 82 7 



2) INFORMATION FOR SEQ ID NO: 270 



25 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 5 (A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 

4 0 GCAAATGATG AACTTCCTAA TATATTTAAT GCAATACACA TAAAAATGGA 50 

CGATGGAAAA ATCTTAGTTT GTGAGGTAGA GCAACACGTA GGAGACGATA 100 

TAGTTAGAAC TATAGCTATG GAAGCTACTG AAGGACTAAG AAGAGGTGTA 150 

GAAGCTGTTG ATACAGGAGC ACCTATATCA GTACCAGTTG GTGAATGCGT 2 00 

ATTAGGAAGA ATATTTAACG TATTAGGTAA ACCACTAGAT AGTGGAGCTG 2 50 

4 5 AAGTTAATAA CGAAGAAAAA TATCCAATTC ATAGACCAGC TCCATCATTT 3 00 

GAAGAACAAT CAGTTGTTCC TCAAATGTTT GAGACAGGAA TAAAGGTTAT 3 50 

CGACCTTTTA GCACCTTACC AAAGAGGGGG AAAAATCGGT CTATTTGGAG 4 00 

GTGCAGGTGT TGGTAAAACA GTTCTTATCC AAGAGCTTAT AAACAACATA 4 50 

GCTAAAGAGC ACGGTGGACT TTCTGTATTC ACAGGAGTTG GAGAAAGATC 50 0 

50 AAGAGAAGGT AATGACCTTT ACTATGAAAT GATGGAATCA GGAGTTATAA 550 

AAAATACAGC ATTAGTATTT GGACAAATGA ACGAACCACC TGGAGCAAGA 600 

ATGAGAGTTG CTTTAACAGG ACTTACTATG GCTGAGTACT TCAGAGAC CA 650 

AGGT CAAGAC GTGTTATTAT TCATAGATAA CATATTCAGA TTCTCACAAG 70 0 

CTGGATCAGA GGTTTCAGCT TTATTAGGAA GAATAC CATC AGCTGTTGGT 75 0 

55 TACCAACCAA CTCTTGCTAC AGAGATGGGA GCTCTTCAAG AGAGAATCAC 800 

ATCAACTACC CATGGATCAA TTACATCAG 82 9 



60 2 ) INFORMATION FOR SEQ ID NO: 2 71 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271 

TTGTCCACGT TGGATRTCTT CA 

15 

2) INFORMATION FOR SEQ ID NO: 272 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 818 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

JO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 



2) INFORMATION FOR SEQ ID NO: 273 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



40 



35 




50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
818 
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(vi) ORIGINAL SOURCE: , , . . _ 

(A ) ORGANISM: Corynebacterium pseudodxphthenticum 

(B) STRAIN: ATCC 10700 

b (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 

rcRTGCCTGC TCTGTACAAC GCGCTGACTG TCGAGGTCAC CCTCGAGGCA 50 
ScgccSga CCATCACGCT TGAGGTTGCA CAGCACCTCG GCGATAACCT 100 
10 GATCcS£c£ ATTGCgSgG CACCTACGGA CGGTCTCGTC CGTGGCGCTG 150 
SSatcga CACTGGTAAG CCAATTACTG TTCCCGTCGG CGATGCCGTC 
aaaggScaS TCTTCAATGC GCTCGGTGAG TGTTTGGACG AACCAGGATT 
SSS?ScGAC GGCGAACAGT GGGGAATCCA CCGCGATCCG CCACCATTCG 
atgcgctgga GGGCAAAACC GAGATTCTGG AGACTGGAAT caaggttatc 

AlbLbLi^n ™ ^^^^Tn~nr* t\ t\t\ ATTP_nTP TrJTTPGGTGG 



200 
250 
300 

15 SScCTTt Scc™OT TAAGGGTGGC AAAATTGGTC TGTTCGGTGG 400 
15 rrrrrccllZ GGCAAGACCG TTCTTATCCA GGAAATGATC ACTCGTATCG 450 
CGCCGGCGTC CiOL-AA^uuo ±£ztL*™»M w ^r«! Tr.inmTRCC 500 

550 

,r Stgtctggt CTAAC^g" CTGAATATTT CCGCGACGTT til 
?AAAA2SS SSttSotT S?TCATTGAC AACATCTTCC GTTTTACTCA 700 
CAAAACCA^ m « m m^o PnnTnTP.rrT TPP^PPGTGG 750 

800 
833 



25 



CTCgSaCTT SHSSSSS TCCGTGTTCG CCGGCGTCGG TGAGCGTAC C 
CGTGAGGGTA CTGACCTGTT CCTGGAAATG GAAGAGATGG GCGTGTTGCA 
agac^c?gcc CTTGTCTTCG GTCAAATGGA CGAACCACCA GGGGTTCGTA 

AbALALLbC^- ^ X ~ « m ,,nn^oo PTnTiRTaTTT PPGPGACGTT 



GAGGTTTCCA CGC^Sg CCGTATGCCT TCCGCCGTGG 
GTTATCAGCC AACATTGGCT GATGAGATGG GTGTTTTGCA GGAACGGATT 
ACCTCTACAC GTGGTAAGTC AATTACTTCC CTG 

2) INFORMATION FOR SEQ ID NO: 274 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CoryneJbacterxum ulcerana 

40 (B) STRAIN: NCTC 8665 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 

PPPTGCTCAT CCAGGAGATG ATCACCCGTG TGGCCCGCAA CTTCGGCGGC 

Sotctgtqt TCGCCGGCGT CGGCGAGCGC ACCCGTGAGG GCAACGACCT 100 

?TCGG?2Sg ATGGACGAGG CCGACGTGCT CAAGGACACC GCCCTGGTGT 150 

SSScCAGAT GGACGAGC CG CCGGGAACCC GTCTGCGCGT GGCCCTGTCC 200 

SScSScCA TGGCGGAGTA CTTCCGCGAT GTGCAGAACC AGGACGTGCT 250 

SSSraTC GACAACATCT TCCGCTTCTC CCAGGCCGGC TCCGAGGTCT 300 

50 2S?StGCT GGGCCGCATG CCCTCCGCGG TGGGCTACCA GCCGAACCTG 350 

gSSSaSI TCGGTGTGCT GCAGGAGCGC ATCACCTCGA CTCGCGGCCA 400 
CTCCATCACC TCGATGC 



55 



60 



2) INFORMATION FOR SEQ ID NO: 275 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 



50 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium urealyticum 

(B) STRAIN: ATCC 43 042 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 

GGGCAGCAGC CAGCACTATT CAACGCGCTG CACGTCGAGG TTGACCTCGA c n 

GGCAGTTGCG AAGACCATTA CCCTGGAGGT CGCACAGCAC CTGGGTGACA 1 on 

ACCTGGTGCG CACCGTCTCC ATGGCCCCGA CCGACGGCCT GGTCCGCG^ 150 
15 GCAGAGGTCA AGGACACCGG TAAGCCGATC TCTGTGCCAG TCGGCGATGT 



200 
250 

CATCGACCTG CT^CCCCTT ACGTCAAGGG C« SScSSSSS E 
20 *^ CC ™ TTCAGGAGAT SSS 450 

500 

CGTATGCGCG TGGCTCTGTC 555fS5ES ATGGCGGAGT S55£33£ S§ 

25 TGTTCAGGGC caggacgtgc tgctgttcat cgacaacatc ttccgtttca 5 o 

CCCAGGCAGG TTCTGAGGTC TCCACGCTGC TCGGCCGCAT GCCGTCCGCA* 7^n 

GTGGGTTACC AGCCGACCCT GGCTGACGAG ATGGGTGTTC TGCAGGAGCG son 

CATTACCTCC ACGAAGGGTA AGTCCATTAC CTCCC TGCAGGAGCG 800 

30 



^-^ \_ X 1 _ VjV^. ^ f-\ \ 1 I I I w ( -t{ VJj\ J. C 7 

TGTCAAGGGG CACGTCTTCA ACGCCCTGGG CGACTGCCTG GATGAGCCAG 
GTCTCGGCCG CGACGGCGAG CAGTGGGGCA TCCACCGCGA GCCACCGGCA 
TTCGACGAGC TCGAGGGTAA GAC CGAGATC CTGGAGACCG GCGTTAAGGT 
CATCGACCTG CTGACCCCTT ACGTCAAGGG CGGCAAGATT GGCCTCTTCG 
GTGGTGCAGG TGTGGGTAAG ACCGTCCTGA TTCAGGAGAT GATTACCCGT 
ATCGCCCGCG AGTTCTCCGG TACCTCCGTG TTCGCCGGCG TCGGCGAGCG 
TACCCGTGAG GGTACGGACC TCTTCCTCGA GATGGAGGAG ATGGGCGTGC 

^gctggtg ll^l^ J^atgagcc gccgggagtc 



2) INFORMATION FOR SEQ ID NO: 2 76 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 



CCCGTCACGC TGTCCCGAAA GTGTACGATG CTTTGCAGGT GGATGAAAAT * n 

AATTTAACGC TCGAAGTCCA ACAGCAACTC GGGGACGGTG TCGTGCGCAC Ton 

° GGCAGCA CTG AGGGCTTAAA ACGCGATATC SSoSSSS l 50 

ATACGGAAAA ACCGATTGAA GTTCCCGTAG GAAAAGAAAC TTTAGGTCGT , 

ATCATGAACG TGCTGGGTGA GCCGATCGAT GAGTTAGGTC SSSKSS llo 

AAAAGAAAAA CTCCCTATTC ATCGTCCTGC GCCC3AGCTTT Stg2£££ Ill 

cc S? GG CGAATTATTA GAAACCGGTA TTAAAGTGGT CGATTTGCT? \ln 

55 TGCCCCTTTG CTAAGGGAGG CAAAGTGGGT CTTTTTGGAG GCGCGGGCPT Inn 

TGGAAAAACG GTTAATATGA TGGAATTAAT CCGTAATATC TCCATTGAA? tin 

ACAGCGGTTA TTCTGTTTTT GCGGGTGTGG GAGAAAGAA? gSgAAG^C tan 
AATGATTTTT ATCATGAAAT GAAAGAATCC AATGTCTTGG ATAAAGTGGC 
GTTGGTGTAC GGACAAATGA ACGAGCCGCC AGGGAACCGC ttcotSSS 
60 GTTTGACGGG GCTTACGCTG GCGGAAGCCT SSSSJSS SSSKSS 



550 
600 
650 
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PTTCTGTTAT TTATCGATAA TATCTTTCGT TACACTTTGG CAGGGGTTGA 700 

SSScTGOC CTCCTCGGTC GGATGC CATC GGCTGTGGGT TATCAGCCGA 750 

CG??GGC?GA AGaStGGGG GCCCTGCAAG AACGCATTAC TTCCACTAAA 800 
AAAGGGTCCA TTACGTCG 



2) INFORMATION FOR SEQ ID NO: 277 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella hoshinae 

2 0 (B) STRAIN: ATCC 33379 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 

TTPCCGCAGG ACGCCGTGCC GAAGGTGTAT AACGCACTGG AAGTAAAAGG 50 

i* ™SSScG AAACTGGTAC TGGAAGTGCA GCAGCAGCTG GGTGGCGGCG 100 

5?g??cgc?S ^ATCGCGATG GGCTCTTCCG ACGGTCTGCG CCGTGGGCTA 150 

Sgg??gSg accaagacca TCCGATCGAG GTTCCTGTTG GCAAGGCGAC 200 

???SgG^? ATCATGAACG TACTGGGTGA TCCGGTCGAC ATGAAGGGCG 250 

IS??gStGA AGAAGAGCGT TGGGCTATCC ATCGTGCTGC ACCGAGCTAT 300 

™ ?^ATCTCT CTAACTCTCA GGAACTGCTG GAGACCGGCA TTAAGGTTAT 350 

TGCCCGTTCG CTAAAGGCGG TAAAGTGGGC CTGTTCGGTG 4 00 

SS^OTGTOT GGGTAAGACC GTTAACATGA TGGAGCTTAT CCGTAAC AT C 450 

SSScSgC ACTCCGGTTA CTCAGTCTTC GCCGGTGTGG GTGAGCGTAC 500 

?StS^T AACGACTTCT ACCACGAGAT GACCGATTCC AACGTATTGG 550 

JSSStTTC TCTGGTGTAT GGTCAGATGA ACGAGCCACC GGGAAACCGT 600 

3 c?S?S?g?gg cgSgaccgg tctgaccatg gcggagaaat tccgtgatga 
SSttcotgat gtactgttgt tcatcgataa catctaccgt tataccttgg 

AGTCTCCGCT CTGCTGGGCC GTATGCCGTC GGCGGTAGGT 
SSoSccS Stctggcgga GGAAATGGGG GTGCTGCAAG AGCGTATTAC 

4 0 CTCCACTAAG ACCGGGTCCA TCACCTCTG 



600 
650 
700 
750 
800 
829 



2) INFORMATION FOR SEQ ID NO: 278 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella tarda 

(B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 
GCCGTGCCGA AGGTGTATAA CGCACTGGAA GTAAAAGGCG GTGCCACGAA 
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ACTGGTACTG GAAGTGCAGC AGCAGCTGGG TGGCGGCGTC GTTCGCTGCA 100 

TCGCGATGGG CTCCTCCGAC GGTCTGCGCC GTGGGCTGGT GGTTGAAGAC 150 

CAAGACCATC CGATCGAGGT TCCGGTCGGT AAGGCGACCC TGGGCCGTAT 2 00 

CATGAAC GTA CTGGGTGATC CGGTCGACAT GAAGGGCGAG ATCGGCGAAG 2 50 

5 AAGAGCGTTG GGCTATCCAC CGCGCGGCGC CGAGCTATGA AGATCTGTCC 3 00 

AACTCTCAGG AGCTGCTGGA GACCGGCATC AAGGTTATCG ACCTGATTTG 3 SO 

CCCGTTCGCC AAAGGCGGTA AAGTTGGCCT GTTCGGTGGT GCCGGTGTGG 4 00 

GTAAGAC CGT TAACATGATG GAGCTTATCC GTAACATCGC TATCGAGCAC 4 50 

TCCGGTTACT CCGTATTTGC CGGTGTAGGC GAGCGTACCC GTGAGGGTAA 500 

10 CGACTTCTAC CACGAGATGA CCGACTCCAA CGTATTGGAT AAAGTTTCTC 550 

TGGTATACGG CCAGATGAAT GAGCCGCCGG GAAACCGTCT GCGTGTGGCG son 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGCGATGT 650 

GTTGTTGTTC ATCGATAACA TTTATCGTTA TACCTTGGCT GGTACCGAAG 70 0 

TTTCTGCTCT GCTGGGTCGT ATGCCGTCGG CGGTAGGTTA TCAGCCGACC 750 

15 CTGGCGGAAG AGATGGGTGT GTTG CAAGAG CGTATCACCT CAACGAAGAC 8 00 
GGGCTCTAT 

809 



2 0 2) INFORMATION FOR SEQ ID NO: 279 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23834 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279 

TTTCCGCGTA CCGCCATTCC GCGTGTTTAC GATGCACTCA AACTGGTTGA 
TACTGATTTG ACGCTGGAAG TACAGCAGCA GCTTGGTGAC GGCGTTGTCC 
GTAC CATTGC GATGGGTAGT ACAGACGGTT TGAAACGTGG CTTGGCTGTG 
CAAAATACTG GTGCACCGAT TACTGTGCCG GTGGGGAAAG CCACCCTAGG 

4 0 CCGCATCATG GACGTTCTGG GCAACCCCGT GGACGAACAA GGTC CGATCG 

GTTCTGACCA AACCCGTGCT ATTCATCAAT TTGCACCTAA GTTCGACGAA 
CTCTCCAGCA CTACCGAATT GTTGGAAACA GGCATTAAAG TGATCGATTT 
GCTTTGTCCG TTTGCTAAAG GTGGTAAAGT GGGTCTGTTT GGCGGTGCCG 
GTGTGGG CAA AACCGTGAAC ATGATGGAGC TGATTAACAA CATTGCCAAA 45 0 

45 GCGCACAGTG GTCTTTCCGT ATTCGCCGGT GTGGGTGAGC GTACTCGTGA 
AGGTAACGAC TTCTACCACG AAATGAAAGA CTCCAACGTG TTGGATAAAG 
TGGCAATGGT GTATGGCCAG ATGAACGAGC CGCCTGGTAA CCGCTTGCGT 
GTTGCTCTAA CTGGTTTGTC GATGGCTGAA TACTTCCGTG ACGAAAAAGA 
CGAAAACGGC AAAGGCCGTG ACGTATTATT CTTTGTGGAT AATATCTAPP 

50 GCTATACTCT GGCGGGTACC GAAGTGTCGG CTCTGCTTGG CCGTATGCCT 
TCTGCTGTGG GTTATCAGCC AACTTTGGCT GAAGAAATGG GTCGTTTGCA 
GGAGCGTATT ACCTCCACCC AAACTGGTTC CATTACCTCT 



55 



2) INFORMATION FOR SEQ ID NO: 280 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 803 bases 
60 (B) TYPE: Nucleic acid 



50 
100 
150 
200 
250 
300 
350 
400 



500 
550 
600 
650 
700 
750 
800 
840 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

fvi) ORIGINAL SOURCE : 

(A) ORGANISM: Enterobacter agglomerans 

(B) STRAIN: ATCC 27989 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 

rrrrTACCAC GAGTGTACGA TGCACTTGAG GTAAAGAATG GTGAAGAGCG 

„ S s= S EH |S 



CGG 



30 



2) INFORMATION FOR SEQ ID NO: 281 

( i ) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter amnigenua 

(B) STRAIN: ATCC 33072 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

PGAATTCCCT CAGGATGCCG T AC CGCGCGT GTACGATGCT CTTGAGGTAC 
ap^SSta! CGAGAGTCTG GTGCTGGAAG TTCAGCAGCA GCTCGGTGGT 

£St15?gt£c~ gtaSatcgc catgggttct tccgacggtc tgcgtcgtgg 
?r?S5cTOT? aaagatctcg aacacccgat cgaagtcccg gtaggtaaag 

nlwlcll TCGTATCATG AACGTTTTGG GTCAACCAAT CGACATGAAA 
Sgaagaaga CCGTTGGGCA ATCCACCGTG CAGCACCTTC 
^S™aS Sgtctagct CTCAGGAACT GCTGGAAACC GGCATCAAAG 
SSSXSS SSSSSKS TTCGCTAAGG GCGGTAAAGT TGGTCTGTTC 

ggc^tgIS gcStgggtaa aactgtaaac atgatggagc tgatccgtaa 
StcgStc gagcactccg gttactccgt gtttgcaggc gtgggtgagc 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA TTCCAACGTT 

SSgataaag tatccctggt TTATGGCCAG atgaacgagc caccaggaaa 
cStc^c gttgcgctga ccggtctgac tatggctgag aagttccgtg 
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ACGAAGGTCG TGACGTACTG 
CTGGCCGGTA CTGAAGTATC 
AGGTTACCAG CCGACCCTGG 
TCACTTCTAC CAAAACCGGT 



CTGTTCGTAG ATAACATCTA 
TGCGCTGCTG GGCCGTATGC 
CGGAAGAGAT GGGCGTTCTG 
TCTATCACCT CCG 



CCGTTACACC 700 
CTTCAGCGGT 750 
CAGGAACGTA 800 
833 



2) INFORMATION FOR SEQ ID NO: 282 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Enterobacter asburiae 
20 (B) STRAIN: ATCC 35953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 

GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATP rT^rr^.P 

25 ? G * GGTGCTG GAAGTTCAGC AGCAGCTCGG 1 nS 

TCGCGATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGtSJJS? J 2° 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGcScS TGGPTPTtit 150 

CATGAACGTA TTGGGTCAAC CAATCGACAT SaSSSSS I^CgIt^C o°,°n 
.n ^ GAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AgS?^ 

3 0 AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATrr fSSnS^ 3 °° 

TCCGTTCGCG AAGGGCGGTA AAGTCGotS ^SSgS? SSgOTG™ Inn 
GTAAAACCGT AAACATGATG GAGCTGATCC GTAACAT?Sc OATOmS? 

^ GGTTACT CCGTGTTTGC GGGCGTAGGT GAACGTACTC SgAgSSJ? inn 

« S^ZZ CTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC S25SSSS 5SS 

3 5 TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGPP "2 

S^ CGGTC TGAC GATGGC TGAGAAGTTC CGTGATGAAG g22S?5I?S? "2 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC oSSSSJS 65 ° 

tatctgcact gctgggtcgt atgccttcag cg§?agg??a ?S5?2?^g ?22 

40 SSgggg? AGATGGGTGT TCTTCAGOAA OOtSSSS ScSSS ™J 



45 



2) INFORMATION FOR SEQ ID NO: 283 

( i } SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS: Double 
50 CD) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE • 

55 , (A ! ORGANISM : Enterobacter cancerogenus 

(B) STRAIN: ATCC 35317 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 
GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTACAGAATG GTAACGAGAG 



60 

300 



810 
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rrTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACTA 100 

SSISSr TTTTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTAAAAGAC 150 

SSSSE? SSSSSS CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

ttcStCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

* ScS?CCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCC 300 

5SSSSSg AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

^SgScgS AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

SSSSoT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

tcohmSSt SottotSgc GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

in CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

10 SSSSSSo CCAGATGAAC GAGCCACCAG GAAAC CGTCT GCGCGTTGCG 600 

otS2cGGC? ?SaCCATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

StcSot5c GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACCGAAG 700 

TMCTGCACT GCTOGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

15 SJSSSSS aSSgGTGT TCTTCAGGAA CGTATCACCT CAACCAAAAC 800 
CGGTTCTATC A 



20 2) INFORMATION FOR SEQ ID NO: 284 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
25 (c) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

35 rrCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

cSgSgSg gSaGTtSaGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TrGCGATGGG GTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGA.C 150 

SSSSSSc CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

ao SSSaStA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

^?^r??r GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

iSSSSKl AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

?££SSSS AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

rTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 45C 

^ SSSotTACT CCGTATTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 50C 

JSStTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 55C 

SSSSgG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 60 

CTCA™OTC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 65 

TCTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACCGAAG 70 

50 StcTOCACT GCTOGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 75 

CTG^cSSaG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 80< 
CGGTTCTATC ACTTCCG 



2 ) INFORMATION FOR SEQ ID NO: 285 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 766 bases 

(B) TYPE: Nucleic acid 



301 
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(C) S TRANDEDNES S : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 

AATGAGAGCC TGGTGCTGGA AGTTCAGCAG CAGCTCGGCG GCGGTATCGT ™ 
GCGTACCATC GCAATGGGTT CTTCTGACGG TCTGCGTCGC GGTCTGGAAG inn 
TTAAAGATCT CGAACATCCG ATCGAAGTCC CGGTAGGTAA AGCGACCCTC iSo 

15 GGCCGTATTA TGAACGTGCT GGGTCAGCCG GTTGATATGA AAGGCGAtIt 
CGGCGAAGAA GAGCGTTGGG CGATCCACCG CGCTGCGCCG TCCTATGAAG 
AGCTCTCCAG CTCTCAGGAA CTGCTGGAAA CCGGTATCAA ggtaatggac 
CTGATTTGCC CGTTCGCGAA GGGCGGTAAA GTCGGTCTGT TCGGCGGTGC 
GGGCGTTGGT AAAACCGTAA ACATGATGGA GCTGATCCGT AACATCGCGA 

20 TCGAGCACTC CGGCTACTCC GTGTTTGCGG GCGTGGGTGA. ACGTACTCGT tin 
GAGGGTAACG ACTTCTACCA CGAAATGACC GACTCCAACG TTATCGACAA t oo 

AGTATCCCTG GTGTACGGCC AGATGAACGA GCCGCCGGGA AACCGTCTGC lso 
GCGTGGCGCT GACCGGTCTG ACCATGGCTG AGAAATTCCG TCACGAAGGT tlo 
CGTGACGTTC TGCTGTTCGT CGATAACATC TACCGCTATA CCCTctcCGG fiSn 

25 TACTGAAGTA TCCGCACTGC TGGGCCGTAT GCCTTCTGCA GTAGGTTACC 7n2 

SSSSSS SEK*"" atgggtgttc tgcaggaaS SSSSS 7 7 5 °2 

766 



30 



40 



2) INFORMATION FOR SEQ ID NO: 286 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 805 bases 
35 (B) TYPE: Nucleic acid 

(C) S TRANDEDNES S : Double 

(D) TOPOLOGY: Linear 



200 
250 
300 
350 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 

(B) STRAIN: ATCC 49162 

45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 

GGGG * ACCAC GC GTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 
CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GT^SSSS 
*n JSS^H? 00 TTCTT CCGAC GGTCTGCGTC GTGGTCTGGA SSSSSS 
50 CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCG?a? 
CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC A?2S5gSg 
AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AoScSJS? 

??SSSS ^^tc aaag5?a?2S aS?S?g?g 

cc IS?*?™ G AAGGGCGGTA aagttggtct gttcggtggt gcgggtgtag 

55 GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAPPap 
TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACG?a5?2 

^r cTAC cacgaaatga ccgactccaa cgttctggI? SS?????? 

^™ TACGG CCAG ^TGAAC GAGCCACCAG GAAACCGTCT GCG^GCTGcS 
*n 2^2 TGGCC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCG?S2S? XX 
60 TCTGCTGTTC GTCGATAACA TCTACCGTTA CACCC?S£2 SctSSSg 7o2 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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20 



40 



55 



TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCAACG 750 
CTTGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CGAC CAAAAC 8 00 

CGGTT 805 



2) INFORMATION FOR SEQ ID NO: 287 



( i ) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter sakasakii 

(B) STRAIN: ATCC 29544 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 



TACGACGCCC TTGAGGTAAC GAATGGTAAT GAGCGTCTGG TGCTGGAAGT 50 

CCAGCAGCAG CTCGGCGGCG GTATCGTACG TACCATCGCG ATGGGCTCTT 100 

2 5 CCGACGGTCT GCGTCGCGGT CTGCCTGTTG CAGACCTTGA GCACCCGATC 150 

GAAGTGCCGG TAGGTAAAGC GACGCTGGGT CGTATCATGA ACGTCCTGGG 2 00 

TCAGCCTATC GACATGAAAG GCGACATCGG CGAAGAAGAG CGTTGGGCGA 2 50 

TTCATCGCGC GGCGCCGTCC TATGAAGAGC TGTCCAGCTC TCAGGAACTG 3 00 

CTGGAAACCG GCATCAAAGT TATCGACCTG ATGTGTCCGT TCGCGAAGGG 350 

30 CGGTAAAGTC GGTCTGTTCG GTGGTGCAGG TGTAGGTAAA ACCGTAAACA 4 00 

TGATGGAGCT TATTCGTAAC ATCGCGATTG AGCACTCCGG TTACTCCGTG 450 

TTTGCGGGCG TGGGCGAACG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

AATGACCGAC TCCAACGTAC TGGATAAAGT ATCCCTGGTG TACGGCCAGA 550 

TGAACGAGCC GCCGGGAAAC CGTCTGCGCG TTGCGCTGAC CGGCCTGACC 6 00 

3 5 ATGGCTGAGA AATTCCGTGA CGAAGGTCGT GACGTTCTGC TGTTCGTCGA 650 

CAACATCTAC CGTTACACCC TGGCCGGTAC TGAAGTATCC GCACTGCTGG 7 00 

GCCGTATGCC TTCAGCGGTA GGTTATCAGC CGACCCTGGC GGAAGAGATG 750 

GGTGTTCTGC AGGAGCGTAT CACCTCCACC AAAACCGGTT C 7 91 



2) INFORMATION FOR SEQ ID NO: 288 



(i) SEQUENCE CHARACTERISTICS: 

4 5 (A) LENGTH: 83 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288 



TTTCTTTAGA TCAATCCTTA CCAGACATCA ACAATGCGTT GATTGTTTAC 50 

AAAAAAGATA AAACAAAAGT TGTTCTTGAA GTTGCTTTGG AACTTGGTGA 100 

60 TGGTGTTATC CGCACAATCG CTATGGAGGC TACTGATGGA TTGCAACGTG 150 
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GAATGGAAGT TGTCGATACT GGCAAATCAA 
GATACTCTAG GTCGTGTGTT TAACGTATTA 
AGCACCTTTT CCAGAAGATG CAGAAAGAAG 
CTGCTTTTGA AGACCTTAGT ACAAGTAACG 
5 AAGGTTATCG ACTTATTAGC CCCTTATTTA 
ATTCGGTGGT GCCGGTGTTG GTAAAACCGT 
ATAATATCGC CCAAGAACAC GGTGGTATTT 
GAACGTACTC GTGAAGGGAA CGACCTTTAT 
CGTTATTGAG AAAACAGCCA TGGTGTTCGG 
10 GTGCACGTAT GCGTGTTGCC TTGACTGGTT 
CGTGATGAAG AAGGACAAGA TGTGTTGCTA 
CTTTACTCAA GCCGGATCAG AAGTTTCTGC 
CAGCCGTTGG GTATCAACCA ACTTTGGCAA 
GAACGAATCA CTTCAACCAA AAAAGGTTCG 

15 



TCTCCGTTCC TGTAGGTAAA 2 00 

GGTGAAACGA TTGATAAAGA 250 

CGGCATTCAT AAAAAGGCGC 3 00 

AGATTTTGGA AACAGGGATC 3 50 

AAAGGTGGGA AAGTCGGACT 400 

TTTGATCCAA GAATTAATTC 450 

CAGTGTTTAC CGGTGTTGGG 500 

TATGAAATGA AAGACTCTGG 550 

ACAAATGAAC GAGCCGCCTG 60 0 

TGACATTAGC TGAATATTTC 650 

TTTATTGACA ACATCTTCCG 70 0 

CTTATTAGGA CGTATGC CAT 750 

CTGAAATGGG GCAATTACAA 80 0 

ATCACTTCA 83 9 



2) INFORMATION FOR SEQ ID NO: 2 89 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
30 (B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 

TTCTCTCTAG ACCAATCATT AC CAGATATC AACAATGCGT TGATTGTTTA 5 0 

35 CAAAAAAGAT GAGCAGAAAA CAAAAGTTGT GTTAGAAGCT GCCTTAGAAC 100 

TTGGCGACGG CGTTATCCGT ACGATTGC CA TGGAATCAAC AGATGGCTTA 15 0 

CAACGAGGAA TGGAAGTAAT CGATACAGGC GCCTCCATTT CTGTTCCAGT 200 

TGGGACAGAA ACCTTAGGAC GGGTGTTTAA TGTCTTAGGG GACACCATCG 250 

ATTTAGAAGC GCCGTTCCCT GAAGAAGCAC CCCGCAGTGG GATTCACAAA 3 00 

4 0 AAAGCACCTG ACTTTGATGA ATTGTCAACA AGTACGGAGA TCCTTGAAAC 350 

TGGGATCAAA GTTATCGATT TGTTAGCCCC TTATTTAAAA GGGGGGAAAG 4 00 

TTGGACTTTT CGGTGGTGCC GGTGTTGGTA AAACCGTCTT GAT CCAAGAA 450 

TTGATCCACA ACATCGCCCA AGAGCATGGT GGGATCTCTG TCTTCACAGG 50 0 

TGTTGGTGAA CGGACACGTG AAGGAAATGA CCTTTATAAT GAAATGAAAG 550 

45 AATCTGGCGT TATCGAAAAA ACAGCCATGG TGTTTGGACA AATGAACGAA 60 0 

CCACCAGGTG CTCGGATGCG GGTAGCCTTG ACTGGTTTGA CATTAGCCGA 65 0 

GTACTTCCGT GATGTGGAAG GACAAGACGT GCTCTTGTTT ATCGATAATA 700 

TCTTCCGCTT CACTCAAGCA GGTTCTGAAG TATCTGCCTT ACTAGGTCGG 750 

ATGCCGTCTG CCGTTGGGTA TCAGC CAACA TTAGCAACTG AGATGGGGCA a0 0 

50 ATTACAAGAA CGGATCACAT CGACGAAGAA AGGTTCCGTT ACGTCTA 847 



2) INFORMATION FOR SEQ ID NO: 2 90 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 

(B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 



in TTTCTTTAGA CCAATC CTTA CCAGATATCA ACAACGCTTT AGT TGTTTAT 50 
10 SSSSro AGAAGAAATC GAAAGTTGTT CTTGAAACAG CGCTAGAATT 100 

200 
250 
300 

15 TT ™ft„r? ^r™^™^ ™^.TAraa r.TAPTGAGAT CCTTGAAACT 350 

400 
450 
500 



JSSJSiSSS GTCATCCGTA GGAATCAACA GATGGTTTAC 

AACGCGGAAT GGAAGTCATT GATACAGAAA AAGCAATTTC TGTACCAGTG 
PGTAAAGAAA CGTTAGGTCG TGTATTCAAT GTATTAGGAG ATACGATCGA 
KjScTGCA CCTTTCCCAG AAGATGCAAA ACGTAGCGAA ATCCATAAAA 



AAGCACCAAA CTTTGATGAG TTAAGTACAA GTAC TGAGAT CCTTGAAACT 
SSSatcSag TTATTGACTT GCTTGCTCCT TACTTAAAAG gtgggaaagt 
^?S£tc GGTGGTGCCG GTGTAGGTAA AACTGTATTG ATCCAAGAAT 450 
?SJ??ATAA TATCGCTCAA GAACACGGTG GTATTTCTGT ATTTACTGGT 500 
9 0 GTOGTGAAC GTACACGTGA AGGTAATGAC CTTTATTATG AAATGAAAGA 550 
SSSaGTT ATTGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC 600 
CACCAGGTGC ACGTATGCGT GTTGCCTTGA CTGGTTTGAC GATTGCTGAA 
TACTTCCGTG ATGTTGAAGG GCAAGACGTG CTATTGTTTA TTGATAATAT 
TTTCCGTTTC ACTCAAGCCG GTT CAGAAGT TTCTGCCCTA TTAGGTCGTA 
25 TGCCTTCTGC CGTTGGGTAC CAACCAACGC TAGCAACAGA AATGGGTCAA 800 
TTACAAGAAC GGATCACTTC AACGAAAAAA GGTT CAATCA CTTCA 845 



650 
700 
750 



3 0 2) INFORMATION FOR SEQ ID NO: 2 91 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 
35 (c) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: ATCC 29212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291 
4 5 



50 



55 



60 GTTTCACCCA AGCCGGTTCA BAAWrnvw 

305 



TTTATAAAAA 


50 


GAACTAGGTG 


100 


TTTACAACGT 


150 


CTGTTGGTAA 


200 


ATTGACTTAG 


250 


TAAAAAAGCG 


300 


AAACAGGGAT 


350 


AAAGT CGGAC 


400 


AGAATTAATT 


450 


CTGGTGTTGG 


500 


AAAGATTCAG 


550 


CGAACCGCCA 


600 


CTGAATATTT 


650 


AACATTTTC C 


700 


TCGGATGCCG 


750 
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TCAGCCGTTG GTTACCAACC AACCTTAGCG ACTGAAATGG GACAATTACA 800 
AGAACGGATT ACTTCAACGA AAAAAGGATC AATTACCTCT 84 0 



2) INFORMATION FOR SEQ ID NO: 292 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 831 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En teroco ecus faecium 

(B) STRAIN: ATCC 19434 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 

TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCATT AGTTGTTTAT 5 0 

AAAAATGACG AAAATAAATC AAAAGTTGTT CTTGAAGCAG CGTTAGAATT 100 

AGGGGACGGA GTGATCCGGA CCATTGCGAT GGAATCAACA GATGGTTTAC 150 

25 AAAGAGGAAT GGAAGTCATT GATACAGGCA AAGCAATCTC TGTTCCTGTA 200 

GGTAAAGAAA CATTAGGTCG CGTATTCAAC GTAC TAGGAG ATACGATCGA 250 

TTTAGAAACA CCTTTCCCAG AAGATGCGGA AAGAAGCGAA ATTCATAAAA 3 00 

AAGCACCAGC CTTTGACGAA TTAAGTACAA GTACAGAAAT TTTGGAAACA 350 

GGGATCAAAG TTATCGATTT GCTTGCCCCA TATTTAAAAG GTGGGAAAGT 400 

3 0 CGGACTATTC GGTGGTGCCG GTGTTGGTAA AACCGTACTG ATCCAAGAAC 4 50 

TGATC CATAA TATCGCCCAA GAACATGGTG GTATTTCTGT ATTTACCGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGAAAGA 550 

TTCAGGAGTT ATCGAAAAAA CAGCCATGGT GTTCGGACAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCTTTGA CTGGTTTGAC GATTGCGGAA 650 

3 5 TATTTCCGTG ATGTAGAAGG TCAAGATGTA CTGTTGTTTA TCGACAACAT 700 

TTTCCGTTTC ACTCAAGCTG GATCTGAAGT ATCAGCCTTG TTAGGACGGA 75 0 
TGCCTTCTGC GGTTGGTTAT CAACCAACAT TGGCAACAGA AATGGGTCAA 
TTGCAAGAAC GTATCACATC TACGAAAAAA G 



800 
831 



40 

2 ) INFORMATION FOR SEQ ID NO: 293 

( i ) SEQUENCE CHARACTER I STI CS : 
45 (A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293 

CCTTACCAGA TATCAACAAT GCGTTGATCG TTTACAAAAA AGACGAGAAA 
AAAACAAAAG TAGTATTGGA AGCCGCTTTA GAACTAGGGG ATGGTGTGAT inn 
60 CCGCACCATC GCAATGGAAT CTACAGACGG TTTGCAACGA GGa2tg5SS 

306 



50 
00 
150 
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TCATCGATAC CGGTGCCTCA ATCTCTGTCC 
GGCCGAGTAT TTAATGTACT AGGCGATACT 
CCCAGAAGAT GCCAAACGTA GTGGCATCGA 
ATGAATTGTC AACAAGTACA GAAATCCTTG 
5 GATTTAT TAG CTCCTTACTT AAAAGGTGGT 
TGCCGGTGTT GGTAAAACCG TATTGATTCA 
CTCAAGAGCA TGGGGGAATT TCAGTATTTA 
CGTGAAGGTA ATGACTTGTA TTATGAAATG 
AAAGACAGCC ATGGTTTTCG GTCAAATGAA 
10 TGCGGGTTGC TTTGACTGGT TTGACCATTG 
GAAGGACAAG ATGTGCTCTT GTTTATCGAT 
AGCGGGTTCT GAAGTATCTG CCTTGTTAGG 
GTTATCAACC AACTCTAGCA AC TGAAATGG 
ACTTCTACGA AAAAAGGATC TGTAAC 

15 



CTGTAGGAAC AGATACTCTA 200 

ATCGACTTGG AAGCACCATT 250 

CAAAAAAGCC CCAGATTTCG 3 00 

AAACTGGGAT CAAAGTTATC 350 

AAAGTCGGCT TGTTCGGTGG 4 00 

AGAATTGATT CACAATATCG 45 0 

CCGGTGTTGG CGAACGGACG 500 

AAAGAATCAG GCGTTATCGA 550 

TGAACCACCA GGTGCCCGGA 600 

CTGAGTATTT CCGTGACGTT 650 

AATATTTTCC GTTTCACACA 70 0 

CCGGATGCCA TCAGCCGTTG 750 

GTCAATTACA AGAACGAATC 8 00 

826 



2) INFORMATION FOR SEQ ID NO: 2 94 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 
30 (B) STRAIN: ATCC 43076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 94 

TTTCTTTGGA CCAATCCTTA CCAGACATCA ACAATGCGTT GGTGGTCTAT 5 0 

35 AAAAAGAATG ATGAAAAAAC AAAAGTGGTA CTTGAAACAG CTTTAGAACT 10 0 

TGGTGATGGT GTCGTACGTA CGATTGCTAT GTCGTCAACA GACGGTTTGC 150 

AACGTGGGAT GGAAGTCATC GATACAGGAG CAT CAATTTC TGTTCCTGTT 200 

GGGAAAGAGA CATTAGGACG TGTATTTAAC GTTTTAGGGG AGACTATCGA 250 

CTTAGATGGT CCATTCCCAG AAGAAGTAGC ACGAGATGGT ATTCATAAAA 300 

4 0 AGGCACCTGA TTTTGATGAA TTAAGTACAA GTACGGAGAT TCTTGAAACA 350 

GGGATTAAAG TAATCGATTT ATTAGCGCCT TACTTAAAAG GTGGGAAAGT 4 00 

TGGTTTATTC GGTGGTGCCG GTGTAGGTAA AACGGTATTA ATTCAAGAAT 4 50 

TGATTAACAA TATTGCGCAA GAACATGGTG GTATTTCAGT ATTTGCGGGT 500 

GTTGGTGAGC GTACTCGTGA AGGAAATGAC CTTTATTATG AAATGAAAGA 55 0 

45 GTCGGGCGTT ATTGAGAAAA CAG CGATGGT TTTTGGACAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGA GTTGCTTTAA CTGGTTTAAC CATTGCAGAA 650 

TACTTCCGTG ATGTTGAAGG ACAAGATGTA TTACTATTTA TTGATAACAT 7 00 

TTTCCGTTTT ACTCAAGCTG GTTCAGAAGT TTCAGCTTTA TTAGGACGTA 750 

TGCCTTCAGC GGTAGGGTAT CAACCGACAT TAGCAACAGA AATGGGACAA 800 

50 TTACAAGAAC GTATTACGTC AACGAAAAAA GGCTCAATTA CATCAA 846 



55 



2) INFORMATION FOR SEQ ID NO: 2 95 



( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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CA 02283458 1999-09-28 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Escherichia fergusonii 

(B) STRAIN: ATCC 35469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295 

10 GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTGCAAAATG GTAATGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGTGGTATC GTGCGTACCA 100 

TCGCAATGGG GTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 200 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 

15 AAGAGC GTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTT CGGTGGT GCGGGTGTAG 400 

GTAAAACTGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

2 0 CGACTTCTAC CACGAAATGA CTGACTCCAA CGTTATCGAC AAAGTATCCC 550 

TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGAC CATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGACAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 70 0 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACT 750 

25 CTGGCGGAAG AGATGGGCGT TCTTCAGGAA CGTATCACCT CCACCAAAAC 800 

TGG ~ 8 03 



3 0 2) INFORMATION FOR SEQ ID NO: 2 96 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia hermarmii 

(B) STRAIN: ATCC 33650 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296 



GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA ATGGTGATGA 50 

GCGTCTGGTG CTGGAAGTGC AGCAG CAGCT CGGCGGCGGT AT CGTGCGTA 100 

CCATCGCAAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GACTGTCGTC 15 0 

GACCTCGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCGA CCCTGGGCCG 2 00 

50 TATCATGAAC GTGCTGGGTC AG C CGATCGA CATGAAAGGC GATATCGGTG 250 

AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCGTCCTA TGAAGAGCTG 3 00 

TCCAGCTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGAC CTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGT CGG TCTGTTCGGT GGTGCGGGCG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 45 0 

55 CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCATGAAA TGACCGACTC CAACGTTCTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAACCGC CGGGAAACCG TCTGCGCGTT 60 0 

GCACTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGT CGTGA 650 

*n n™^™ TG TTCGT CGACA ACATCTACCG TTACACCCTG GCCGGTACTG 700 

60 AAGTATCCGC ACTGCTGGGC CGTATGCCTT CTGCGGTAGG TTACCAGCCG 750 

308 
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r * t » 

ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAGCGTATCA CCTCCACCAA 
AACCGGTTCT ATCACCTCCG TA 



2) INFORMATION FOR SEQ ID NO: 297 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 808 bases 
10 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



25 



30 



35 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 



800 
822 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia, vulnens 

(B) STRAIN: ATCC 33821 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 

CCGAACGTGT ACGACGCCCT CGAAGTGACA AATGGTAATG AG CGT CTGGT 50 

GCTGGAAGTT GAG CAGCAGC TCGGCGGCGG TATCGTACGT ACCATCGCTA 100 

?§GOT^ic CGACGGTCTG CGTCGTGGTC TGGAAGTTCA GGACCTCGAG 150 

S?2?5atcg AAGTGCCGGT AGGTAAAGCG ACCCTGGGTC GTATCATGAA 200 

StACTGGGT CAGCCGATCG ATATGAAAGG CGACATCGGT GAAGAAGAGC 250 

GTTGGGCTAT TCACCGTGCA GCACCGTCCT ATGAAGAGCT CTCCAGCTCT 300 

SgGAACTGC TGGAAACCGG CATCAAGGTT ATCGACCTGA TGTGTCCGTT 350 

Sc?AAGGGC GGTAAAGTCG GCCTGTTCGG CGGCGCGGGC GTGGGTAAAA 400 

c2S5^ACAT GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT 450 

SaCTCCGTGT TTGCAGGCGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 500 

SScACGAG ATGACCGACT CCAACGTTCT GGACAAAGTA TCCCTGGTGT 550 

ACGGCCAGAT GAACGAGCCG CCGGGAAACC GTCTGCGCGT GGCACTGACC 600 

S^TGACCA TGGCTGAGAA GTTCCGTGAC GAAGGTCGTG ACGTTCTGCT 650 

GTTCGTTGAC AACATCTATC GTTACACCCT GGCCGGTACG GAAGTATCTG 700 

SSgSgGG CCGTATGCCT TCAGCGGTAG GTTACCAGCC GACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAGCGTATC ACCTCCACCA AAACCGGTTC 800 
TATCACCT 

40 

2 ) INFORMATION FOR SEQ ID NO: 298 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eubacterium lentum 

(B) STRAIN: ATCC 43 055 



TTTCCCCCTG ATCAGCTGCC GGCGATTTAC AACGCGCTGA CGGTTGATGC 50 
CAAGACCCTG GCGGGCGACT TGCACCTCGT GCTC GAGGTC GAGACGCACC 100 
60 TGCCGGGCAA CCTTGTCCGC TCGGTGGCCA TGAGCTCGAC GGACGGTCTC 150 
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GTCCGCGGCC TCGAGGTCGT CGACACGGGC 
GGGTCCCGAG ACCCTGGGTC GCATCTGGAA 
ACGAGAAGCC GATGCCCGAG GTGAAGGGCT 
GCTCCGGACT ACGACGAGCT GTCCACCACC 
5 CATCAAGGCC ATCGACCTCG TCGAGCCCTT 
GTCTGTTCGG CGGCGCCGGC GTGGGCAAGA 
ATCAACAACC TGGCCCAGGA GCACGGCGGC 
GGGCGAGCGT ACCCGCGAGG GTACCGACCT 
CGGGCGTCAT CAACAAGACC TGCCTCGTGT 
10 CCGGGAGCGC GTCTGCGCGT GGGTCTCGCG 
CTTCCGCGAT CAGGGCCAGG ACGTGCTTCT 
GCTTCACGCA GGCCGGCTCC GAGGTGTCCG 
TCTGCCGTGG GTTACCAGCC GACGCTGGCA 
GGAGCGCATC ACGTCGACGT CCACCGGCTC 



AACCCGATCA TGATGCCCGT 200 

CGTCATGGGC GAGCCCGTCG 250 

ACATGCCCAT CCACCGTCCG 3 00 

ACCGAGATCT TCGAGACCGG 350 

CGTCAAGGGC GGCAAGACGG 400 

CGGTTATCAT CCAGGAGCTC 450 

ACGTCGGTGT TCACGGGCGT 500 

CTACCTGGAG ATGAGC GACT 550 

ACGGTCAGAT GAACGAG CCT 600 

GGCCTCACCG AGGCGGAGTA 65 0 

GTTCGTGGAC AACATCTTC C 70 0 

CTCTGCTGGG CCGCATGCCC 75 0 

AC CGAGATGG GCGACCTGCA 80 0 

CATCACGTCC GTG 843 



2) INFORMATION FOR SEQ ID NO: 2 99 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ewingella americana 
30 (B) STRAIN: ATCC 33 852 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299 

TCCCTCAGGA TGCAGTACCG AACGTGTACA ATGCTCTTGA GGTAGAAAAC c n 
35 JESSES ^TGGTTCT GGAAGTTCAG CAACAGTTaS SSSSSSSS loS 

150 
200 

AATCGGCGAA GA&GAACGTC GTGCAATTCA CCGTCCAGCG CCTTCTTATG III 

4 J SSSSSS 2KSK2* =£™2S «?ccggtat SSSSSSS Itl 

400 
450 
500 
550 
600 
650 
700 



TGTTCGTTGT ATCGCAATGG GTACCTCAGA CGGCCTTCGT CGCGGTCTGA 
AAGTGAACAA CCTGGAACAC CCAATTGAAG TTCCGGTTGG TAAAGCGACT 

ATTGGGTGAA CCAATCGACA TGAAAGGTGA 



^CTGATGT GTCCGT TCGC TAAGGGCGGT AAAGTCGGTC TGTTCGGTGG 
T GCGGGTGTT GGTAAAACTG TAAACATGAT GGAGCTGATC CGtScATCG 
£ GA I CGAGCA CTCCGGTTAC TCAGTGTTTG CAGGCGTGGG TGAGCGTACT 
CGTGAGGGTA ACGACTTCTA CCACGAAATG ACTGACTCCA ACGTTATCgI 
45 CAAAGTTTCC CTGGTCTATG GTCAGATGAA TGAGCCACCA 

TGCGCGTTGC ACTGACCGGC CTGACCATGG CGGAGAAATT CCGTGATGAA 
GG ^GTGACG TACTGCTGTT CGTTGACAAC ATTTACCGTT ACACCcJgSS 
A ^ A S GGAA GTGTCCGCAC TTCTGGGCCG TATGCCATCG GCGOTAGG?? 
so SJSSS^ GCTGGCGG AA GAGATGGGTG CTCTGCAAGA GCGTAtSS 800 
50 TCTACCAAAA GTGGTTCTAT CACCTCCGT ^iAiuin. 800 

82 9 



2) INFORMATION FOR SEQ ID NO: 3 00 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 

(B) STRAIN: LVS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 



10 AACACGCCTA 
ATTAGAAGTT 
TGGGATCTAG 
GCGCCTATTT 
TGTTTTAGGT 

15 AAAGATCTAT 
ACAGAAATAT 
TGC TAAGGGC 
CTGTAACGAT 
TACTCTGTAT 

2 0 CTACTATGAG 
ATGGTCAGAT 
GGCTTAACAA 
GTTTATCGAT 
CGCTACTTGG 

25 GCTGAGATGG 
TATTA 



AAGTATATGA 
CAGCAACAAA 
TGATGGTCTT 
CTGTTCCAGT 
GAACCAATTG 
CCATCAAGCT 
TAGAAACAGG 
GGTAAAGTTG 
GATGGAACTT 
TTTCCGGTGT 
ATGAAATATT 
GAATGAGCCG 
TAGCAGAAGG 
AACATCTATC 
TCGTATGCCA 
GTGC TTTACA 



TGCTTTAAAT 
TTGGTGATGG 
AGACGTGGTA 
TGGACATGGC 
ATGAAGCTGG 
CCTCCTGCAT 
TAT CAAAGTA 
GTTTATTTGG 
ATCAACAATA 
TGGTGAAAGA 
CTAATGTATT 
CCTGGAAACA 
ATTCCGTGAT 
GTTATACATT 
TCTGCTGTGG 
GGAGCGTATT 



GTAGTAGAAG 
CGTAGTTCGT 
TGGAAGTTAA 
ACACTTGGAC 
TCCAATTGAA 
ATGATGAGTT 
GTTGACCTTA 
CGGTGCAGGT 
TTGCAAAAGA 
ACTCGTGAAG 
GGATAAAGTA 
GATTAAGAGT 
GAAAAGCGTG 
AGCAGGTACA 
GTTATCAGCC 
ACATCTACTA 



CTGGTTTAGT 50 

ACAATTGCTA 100 

GAACACAAAT 150 

GTATCATGAA 2 00 

TATACTGAGA 2 50 

AGCATTAAGT 3 00 

TTTGTCCATT 3 50 

GTTGGTAAAA 4 00 

ACATAGTGG C 450 

GTAATGACTT 500 

TCATTAGTAT 55 0 

AGCTCTTAGT 60 0 

ATGTTTTGAT 650 

GAGGTATCGG 700 

AACGCTTGCA 750 

AGACAGGATC 8 00 
805 



30 2) INFORMATION FOR SEQ ID NO: 301 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Fusobacterlum gonidiaformans 

(B) STRAIN: ATCC 25563 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 



GACGAATTGC CAAAAATATA CAATGCATTA AAGGTGCAAG TTGGAGAAAA 50 

AGAACTTGTA TTGGAAGTGC AACAACATTT GGGAAATAAT GTTGTGAGAA 100 

CAGTAGCGAT GGACTCAACA GATGGATTGC TTCGAGGAAT GGAAGTAATG 15 0 

GATACCGGAG CACCGATTAC TGTTCCAGTA GGGAAGGCGG TTTTAGGAAG 200 

50 AATATTGAAT GTTTTGGGAG AGCCTGTGGA TCAAAAAGGG CCTGTGGAAA 250 

CAGAAGAATA TTTACCTATC CATAGAGAAG CACCAAAATT TGAAGAACAA 3 00 

GAAACAGTAA CAGAAATTTT TGAAACAGGA ATTAAAGTCA TAGATTTGTT 3 50 

AGCCCCTTAT ATCAAAGGAG GAAAGACAGG TCTATTCGGT GGAG CCGGAG 40 0 

TAGGGAAAAC AGTTTTAATT ATGGAATTAA TTAATAACAT TGCAAAGGGC 45 0 

55 CACGGAGGAA TTTCTGTGTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 500 

AAGAGATTTA TACAACGAAA TGACAGAGTC CGGAGTTTTG AATAAGACCT 550 

CGTTGGTGTA TGGTCAAATG AATGAGCCGC CCGGAGCAAG ACTTCGTGTG 600 

GCGTTGACAG GATTAACGGT TGCTGAAAAC TTTAGAGATA AAGAAGGGCA 65 0 

AGATGTATTG TTGTTTATCG ACAATATCTT CCGTTTCACA CAAGCAGGAT 700 

60 CAGAAGTATC GGCTCTATTG GGAAGAATTC CATCGGCAGT AGGATATCAA 750 
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CCGAACTTAG CGACAGAAAT GGGAACTTTA CAAGAAAGAA TTACTTCTAC 8 Of! 

AAAATCAGGA TCTATCACTT CGGTA 



2) INFORMATION FOR SEQ ID NO: 3 02 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium necrophorum subsp. nec.ropho.runi 

(B) STRAIN: ATCC 25286 ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 



ACAATGCATT AAAGGTACAG GTGGGAGAAA GGGAACTTGT GTTGGAAGTG 
CAGCAACATT TAGGAAATAA TGTTGTCAGA ACAGTAGCAA TGGATTCAAC 
AGACGGATTA CTTCGGGGAA TGGAAGTGAG AGATACAGGA GTTCCCATTA 
CTGTTCCGGT AGGAAAGGCG GTTTTGGGAA GAATATTAAA TGTCTTAGGG 
GAGCCTGTGG ACGAAAAAGG TCCGATAGAG ACAGAAGAAT ATTTACCAAT 
ACATAGAGAA GCACCGAAAT TTGAAGAACA GGAAACGGTG ACAGAAATTT 
TTGAAACAGG AATTAAAGTC ATTGATTTGT TAGCTCCTTA TATTAAAGGA 
GGAAAAACAG GCCTATTCGG AGGAGCCGGA GTAGGAAAAA CCGTTTTGAT 
TATGGAACTG ATCAATAATA TTGCAAAAGG TCATGGAGGA ATTTCTGTTT 
TTGCAGGAGT TGGAGAAAGA ACGAGAGAGG GAAGAGATCT ATACAACGAA son 
ATGACAGAGT CCGGAGTTTT GAATAAAACT TCTTTGGTAT ATGGGCAAAT Izn 
GAATGAGCCG CCCGGAGCAA GACTTCGAGT GGCTTTAACC GGACTTACTG 600 
TTGCCGAAAA TTTCAGAGAT AAAGAGGGAC AGGATGTCTT ATTGTTCATT Vkcs 
GACAATATTT TCCGTTTCAC ACAAGCAGGT TCGGAAGTAT CGGCACTTTT 
GGGGAGAATT CCTTCTGCAG TGGGATATCA ACC GAACTTG GCGACAGAAA 
TGGGAAGCTT ACAAGAAAGA ATTACTTCTA CAAAATCCGG TTCTATCACT 
TCCGTG 



50 
100 
150 
200 
250 
300 
350 
400 
450 



700 
750 
800 
806 



2) INFORMATION FOR SEQ ID NO: 303 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp. polymorphism 

(B) STRAIN: ATCC 10953 ^ P 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 

GATGAATTGC CTGCAATATA TAATGCTTTA AAAGTAAAAT TAGAAGATAA 
^™ TTGTT CTAQ AAGTTG AACAACATCT TGGTAACAAT GTTGtSg^ 
CTGTTGCTAT GGATTCAACT GATGGATTAA AAAGAGGAAT GGAAGT^aJJ 



50 
100 
150 
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GATACAGGTA AACCAATTAC AGTACCAGTT 
AATATTAAAT GTTTTAGGAG AACCTGTTGA 
CTGAAACATT TTTACCTATT GATAGAGAAG 
GAAACTGAAA CTGAAATATT TGAAACAGGA 
5 AGCACCATAT ATTAAAGGTG GAAAAATAGG 
TAGGAAAAAC AGTTTTAATA ATGGAACTTA 
CATGGAGGAA TTTCAGTTTT TGCAGGAGTT 
TAGAGACTTA TATGGTGAAA TGACTGAATC 
CTCTTGTTTA TGGACAAATG AATGAGCCAC 
10 GCATTAACAG GGCTTACTGT TGCAGAAAAC 
AGATGTTCTT CTATTTATAG ATAATATATT 
CAGAAGTTTC AGCTTTACTT GGAAGAATAC 
CCAAACCTAG CAACTGAAAT GGGTGCTTTA 
AAAATCTGGT TCAATTACAT C 

15 



GGTAAAGCTG TTCTTGGTAG 2 00 

TAATCAAGGT CCTATAAATG 250 

CACCAGAATT TGATGACTTA 3 00 

ATAAAAGTTA TAGACTTATT 3 50 

ATTATTTGGT GGAGCTGGAG 400 

TCAACAACAT TGCAAAAGGA 4 50 

GGAGAAAGAA CAAGAGAAGG 5 00 

AGGAGTTATC ACAAAAACAG 550 

CTGGAGCAAG AC TTAGAGTT 600 

TTTAGAGATA AAGATGGGCA 650 

TAGATTTACA CAAGCAGGTT 700 

CATCAGCTGT TGGATATCAA 75 0 

CAAGAAAGAA TAACATCTAC 800 

821 



2) INFORMATION FOR SEQ ID NO: 3 04 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 
3 0 (B) STRAIN: ATCC 4 914 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304 

TTCCCAGTTG GCTATCTTCC AGATATTTAT AATGCTCTCA AGGTTGATAT 5 0 

3 5 CAACACCGTT GGAAACACGG AGGGAGATAC CGTCCACGAG ATTACATTGG 100 

AAGTTGAGCA GCACCTTGGT GATTCAACTG TGCGAGCAGT GGCACTTAAG 150 

CCTACGGACG GCTTGGTCCG TGGTGCTTTA GTGCGAGATA CTGGTGGCCC 2 00 

AATTTCTGTG CCTGTTGGAG ATGTTACAAA AGGTCACGTT TTTGACGTAA 2 50 

CTGGTAACAT TTTAAACGCT AAACCAGGCG AAAACATTGA GGTGAC CGAG 3 00 

4 0 CGCTGGCCAA TCCACCGCAA CCCACCTGCT TTCGATCAGC TTGAGTCTAA 350 

GACTCAAATG TTTGAAACAG GCATTAAGGT TATCGATTTG CTTACGCCTT 4 00 

ACGTTCAGGG CGGAAAGATT GGTCTGTTCG GTGGTGCAGG CGTTGGTAAA 4 50 

ACTGTGTTGA TTCAGGAGAT GATTCAGCGC GTTGCACAGA ACCACGGCGG 500 

TGTGTCTGTG TTTGCTGGCG TTGGCGAACG TAC TCGTGAG GGTAACGATT 550 

4 5 TGATTGG CGA AATGGCTGAG GCTGGCGTTT TGGAGAAAAC AGCGCTTGTC 600 

TTTGGTCAGA TGGATGAGCC TCCTGGGACT CGTCTTCGTG TGCCTCTTAC 650 

TGCTTTGACT ATGGCTGAGT ATTTC CGTGA TGTTCAGAAT CAGGATGTGT 700 

TGC TGTTTAT CGACAACATC TTCCGCTTTA CTCAGGCAGG TTCTGAGGTT 750 

TCCACGTTGC TTGGTCGTAT GCCTTCTGCA GTTGGTTATC AGCCAAACTT 8 00 

50 GGCGGATGAA ATGGGTGCGT TGCAGGAGCG CATTACTTCT ACGCGCGGTC 850 

ATT CTATTAC GTCG 864 



55 2) INFORMATION FOR SEQ ID NO: 305 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 8 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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10 



30 



40 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella haemolysans 

(B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 



TCGAATCAGG GCATATGCCA AATCTATTAA ACGCTTTAGA AGTTTACATA 50 

GAAAAAGGCG ATGGGAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 10 0 

AATTGGTGAT AACGTAGTAA GAACAATCGC TATGTCATCT ACTGATGGAT 150 

TAAATAGGGG AGCAGAAGTA GTAGATACAG GAGCACCAAT TACAGTTCCT 200 

15 GTAGGTAACT ACACATTAGG TCGTGTGTTC AACGTATTAG GTGAAGCAGT 25 0 

TGAC CACGGT GAAGAAGCAG GAGCAGAAGT TCGTAAAGAT TCAATTCACA 300 

AAGAAGCTCC AACATTCGAT GAATTATCAA CTCACGTTGA GGTTCTTGAA 350 

ACAGGTATTA AAGTTATCGA CTTACTTGCA CCATATATTA AAGGTGGTAA 400 

AATCGGTCTT TTCGGTGGTG CGGGAGTTGG TAAAACGGTT CTTATCCAAG 4 50 

2 0 AACTTATCAA CAACGTTGCG CAACAACACG GTGGATTATC AGTATT CACA 500 

GGTGTAGGTG AGCGTACTCG TGAAGGAAAT GACTTATACT ATGAAATGAA 550 

AGATTCTGGT GTTATTAACA AAAC AGC CAT GGTATTCGGA CAAATGAACG 600 

AACCACCAGG TGCTCGTATG CGTGTAGCAT TAACAGGATT AACAATGGCG 650 

GAATACTTCC GTGATGAAGA AGGACAAGAC GTGCTTCTAT TCATCGATAA 700 

25 CATTTTCCGT TTCACACAAG CAGGTTCTGA GGTTTCTGCG TTATTAGGAC 750 

GTATGCCATC AGCCGTTGGT TACCAACCAA CACTTGCTAC AGAGATGGGA 800 

CGTTTACAAG AACGTATAAC ATCAACTAAA AAAGGTTCTG TTACATCT 84 8 



2 ) INFORMATION FOR SEQ ID NO: 3 06 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 848 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 2 7824 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306 

TCGAATCAGG GCATATGCCT AATCTACTAA ACGCTTTAGA AGTTTATATA 
GAAAAAGGCG ATGGAAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 
AATCGGGGAT AATGTCGTAA GAACTATTGC GATGTCATCT ACTGATGGAT 150 

50 TAAACAGAGG GGCAGAAGTA GTTGATACTG GAGCGCCAAT TACAGTGCCA 2 00 

GTAGGTAACT ATACATTAGG ACGTGTGTTC AACGTATTAG GTGAAGCAGT 250 
TGACCACGGA GAAGAAGCTG GAGCAGAAGT TCAAAAAGAA TCTATTCATA 3 00 

AAGAAGCTCC AACTTTCGAA GAATTATCAA CACATGTTGA GGTATTAGAA 3 50 

ACAGGTATTA AAGTTATCGA CCTTCTTGCA CCATATATTA AAGGTGGTAA 

55 GATTGGACTA TTCGGTGGTG CTGGAGTTGG GAAAACAGTT CTTATCCAAG 
AACTTATTAA CAACGTAGCA CAACAACACG GAGGACTTTC AGTATTTACT 
GGGGTAGGTG AACGTACTCG TGAGGGTAAC GACTTGTACT ATGAAATGAA 
AGACTCTGGA GTTATTAATA AAACTGCCAT GGTATTTGGT CAAATGAATG 600 
AGCCACCAGG TGCACGTATG CGTGTTGCCT TAACAGGATT AACAATGGCA 650 

60 GAGTACTTCC GTGATGAAGA AGGACAAGAC GTACTATTAT TTATCGATAA 70 0 

314 



400 
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TATCTTCCGT TTCACACAAG CAGGGTCTGA GGTATCTGCA TTATTAGGGC 750 
GTATGCCTTC AGCCGTTGGA TATCAACCAA CTCTTGCAAC AGAAATGGGA 80 0 

CGTCTTCAAG AACGTATTAC ATCAACTAAA AAAGGATCTG TTACATCT 84 8 



2 ) INFORMATION FOR SEQ ID NO: 307 

( i ) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 



GATGCAGTAC CAAAAGTATA TGATGCTTTA AAAGTTGAAT CAGGTTTAAC 50 

CTTAGAAGTT CAACAACAAT TAGGTGGTGG TTTAGTACGT TGTATCGCAT 100 

25 TAGGTACCTC AGATGGTTTA AAGCGTAGCT TAAAGGTTGT AAATACAGGT 150 

AACCCTATTC AAGTTCCTGT AGGCAC TAAA ACATTAGGCC GTATTATGAA 2 00 

TGTATTAGGC GAACCAATTG ATGAAAAAGG ACCTATTAGC GAAGAAGCTC 250 

GTTGGGATAT TCATCGTGCG GCTCCAAATT ATGAAGAACA GTCAAATAGT 3 00 

ACTGAATTAC TTGAAACCGG TATCAAAGTT ATTGACTTAA TTTGTCCATT 3 50 

3 0 TGCAAAAGGT GGTAAAGTCG GCTTAT TTGG TGGAGCTGGT GTAGGTAAAA 4 00 

CCGTTAATAT GATGGAATTG ATC CGTAATA TTGCTATTGA GCACTCAGGT 4 50 

TATTCGGTTT TTGCTGGTGT AGGTGAGCGT ACTCGTGAAG GTAATGATTT 500 

TTATCATGAA ATGACGGATT CTAATGTATT AGATAAAGTA TCACTAGTAT 550 

ATGGTCAAAT GAATGAACCA CCAGGTAACC GCCTACGTGT TGCGTTAACA 600 

3 5 GGTTTAACTA TGGC TGAAAA ATTCCGTGAT GAAGGTCGTG ATGTATTATT 650 

TTTCGTAGAT AATATTTATC GTTATACTTT AGC CGGTACA GAAGTTTCTG 7 00 

CTTTATTAGG CCGTATGCCA TCAGCGGTAG GTTATCAACC AACCCTTGCA 750 

GAAGAAATGG GTGTATTACA AGAACGTATT ACCTCAACTA AAACTGGTTC 80 0 

AATCACGGCA GTA 813 

40 



2) INFORMATION FOR SEQ ID NO: 3 08 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

•50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 
55 (B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 

TGAATTTC CA CAAGATGCAG TGCCAAAAGT TTACGATGCA TTAAAAGTTG 50 
60 AATCAGGTTT AACACTTGAG GTGCAACAAC AATTAGGTGG CGGTGTGGTA 100 
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CGTTGTATCG CATTAGGTGC TTCTGACGGT TTAAAACGTG GTTTAAAAGT 150 

AGAAAACACG AATGATCCGA TTCAAGTACC GGTAGGCACA AAAACCCTTG 200 

GTCGTATCAT GAATGTATTG GGTGAACCAA TTGACGAACA AGGTCCAATC 250 

GGTGAAGAAG AGCGTTGGGC TATCCATCGT TCTGCACCAA GCTATGAAGA 300 

5 ACAATCAAAC AGTACGGAAT TAT TAGAGAC TGGTATCAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCAAAA GGTGGTAAAG TTGGTCTATT CGGTGGTGCG 400 

GGTGTAGGTA AAAC CGTTAA CATGATGGAA TTAATCCGTA ACATCGCGAT 450 

CGAGCACTCA GGTTACTCCG TATTTGCGGG TGTAGGTGAA CGTACTCGTG 50 0 

AAGGTAACGA CTTCTATCAT GAAATGAAAG ATTCTAACGT ATTAGATAAA 550 

10 GTATCTTTGG TTTATGGTCA GATGAATGAG CCACCAGGTA ACCGTTTACG 600 

TGTTGCGTTA ACTGGTTTAA CCATGGCAGA AAAATTCCGC GATGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAATATCT ATCGTTATAC CCTTGCTGGT 70 0 

ACGGAAGTAT CTGCGTTATT AGGTCGTATG CCATCTGCGG TAGGTTACCA 750 

ACCAACTCTT GCTGAAGAAA TGGGTGTGTT ACAAGAACGT ATCACTTCAA 800 

15 CCAAAACAGG TTCTATTACA TCTGTA ~ ' * " 826 
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2) INFORMATION FOR SEQ ID NO: 309 



55 2) INFORMATION FOR SEQ ID NO: 310 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
g 0 (C) STRANDEDNESS : Double 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN: ATCC 10014 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309 

3 5 GATGCAGTAC CAAAAGTATA TGATGCGTTA AAAGTTGAAT CAGGTTTAAC 50 

GCTTGAAGTT CAACAACAAT TAGGCGGTGG CTTAGTGCGC TGTATCGCAT 10 0 

TAGGTACGTC TGATGGTTTA AAACGTGGCT TAAAAGTAGA AAATACAGGC 150 

AACCCAATTG AAGTGCCAGT GGGCACTAAA ACCCTTGGTC GTATTATGAA 200 

CGTATTGGGT GAGCCGATTG ACGAAAAAGG TCCTATCGGT GAAGAAGCAC 250 

4 0 G CTGGG CAAT CCACCGTGCA GCACCAAGCT ACGAAGAGCA ATCAAATAGC 300 

ACGGAATTAC TCGAAACAGG TATCAAAGTT ATCGACTTAA TCTGCC CATT 3 50 

CGCAAAAGGG GGTAAAGTTG GTTTATTTGG TGGTGCAGGT GTAGGTAAAA 4 00 

CCGTAAATAT GATGGAGTTA ATCCGTAACA TCGCGATCGA ACACTC TGGT 450 

TACTCTGTAT TTGCAGGGGT AGGTGAGCGT ACTCGTGAAG GTAATGACTT 50 0 

45 CTACCACGAA ATGACAGACT CTAACGTATT AGATAAAGTA TCGTTAGTGT 550 

ATGGTCAAAT GAACGAACCA CCAGGTAACC GTTTACGCGT AGCTTTAACA 60 0 

GGCTTAACCA TGGCGGAAAA ATTCCGCGAT GAAGGT CGTG ACGTATTATT 65 0 

CTTCGTCGAT AACATCTACC GTTATACCCT AGCAGGTACG GAAGTGTCAG 7 00 

CACTTCTCGG TCGTATGCCA TCTGCGGTAG GTTATCAGCC AACCTTAGCA 75 0 

50 GAAGAAATGG GTGTATTACA AGAGCGTATC ACTTCAACCA AAACTGGTTC 80 0 
TATCACCTC 



809 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus parainfluenzae 

(B) STRAIN: ATCC 7901 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 



CGAATTTCCA CAAGATGCAG TACCAAAAGT TTATGATGCA TTAAAAGTTG 50 

AATCGGGTTT AACCCTTGAA GTTCAACAAC AATTAGGTGG TGGTGTGGTA 100 

CGTTGTATCG CACTGGGAGC TTCTGACGGT TTAAAACGCA GTTTAAGCGT 150 

TGAAAATACC AATAAACCAA TTTCAGTACC GGTTGGTGTA AAAACTCTCG 2 00 

15 GTCGTATTAT GAACGTATTG GGCGAACCGA TTGATGAAAG AGGTCCTATC 250 

GGTGCGGAAG AAGAATGGGC AATTCACCGT TCTACTCCAA GTTATGAAGA 3 00 

ACAGTCCAAC AGTACCGAAT TATTAGAAAC CGGTATCAAA GTTATCGACT 3 50 

TAATTTGTCC ATTCGCGAAG GGTGGTAAAG TTGGTTTATT CGGTGGTGCG 400 

GGTGTAGGTA AGAC CGTAAA TATGATGGAA TTAATCCGTA ATATTGCGAT 450 

2 0 TGAGCACTCA GGTTACTCCG TATTTGCCGG TGTAGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTACCAT GAAATGACAG AATCTAACGT ATTAGACAAA 550 

GTATC CCTAG TTTACGGACA AATGAATGAG CCGCCGGGTA ACCGTTTACG 600 

TGTTGCTTTA ACCGGTTTAA CCATGGCAGA AAAATTCCGT GACGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAACATCT ATCGTTATAC CCTTGCAGGG 7 00 

25 ACTGAAGTAT CGGCACTTTT AGGCCGTATG CCATCAGCGG TAGGTTATCA 7 50 

GCCGACACTT GCAGAAGAAA TGGGTGTGTT ACAAGAACGT ATTACAT CAA 800 

CCAAAACAGG TTCTATTACT TCTG 824 



30 



2) INFORMATION FOR SEQ ID NO: 311 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 811 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 13337 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 

GCCGTGCCTA AAGTGTATAA CGCACTTGAG GTGAAAGGCG GTGCCACTAA 50 

ACTGGTACTG GAAGTTCAGC AGCAGCTAGG CGGCGGCGTT GTACGCTGTA 100 

TCGCTATGGG TACTTCTGAC GGTCTGCGTC GCGGACTGGA CGTTGTTGAC 150 

50 CTGGAGCACC CGATTGAAGT CCCAGTAGGT AAAGCGACCT TAGGCCGCAT 2 00 

TATGAACGTA CTGGGTGAGC CAATTGATAT GAAGGGTGAT ATCGGCGAAG 250 

AAGATCGCTG GGCTATTCAC CGTGAAGCTC CAAGCTACGA AGAACTGTCT 3 00 

AACTCGCAAG AACTGCTGGA AACTGGTATC AAGGTAATGG AC CTGATTTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTT CGGTGGT GCGGGTGTTG 400 

55 GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCAGGTTACT CTGTATTTGC CGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTATTGGAC AAAGTATCAC 550 

TGGTTTATGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACTATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

60 ACTGCTGTTC ATCGATAACA TCTACCGTTA TACCTTGGCC GGTAC CGAAG 700 
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TATCTGCACT GTTGGGTCGT ATGCCTTCTG CGGTAGGTTA TCAGCCAACG 75 0 

CTGGCGGAAG AGATGGGTGT TCTGCAAGAA CGTATCACCT CGACCAAAAC fion 
GGGTTCAATC A °„ 

oil 



2) INFORMATION FOR SEQ ID NO: 312 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella klngae 

(B) STRAIN: ATCC 23330 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 



GCGATGCTAT TCCACGCGTT TACGATGCGT TGAAACTGGT TGATGTGGAC RO 

TTGACATTAG AAGTGCAACA ACAACTGGGC GATGG CGTAG TGCGTAC CAT 100 

2 5 TGCGATGGGT AGTACCGATG GTTTGAAACG CGGCTTAGCC GTGAACAACA 150 

CAGGCGCACC TATTACAGTG CCTGTTGGTA AAGCAACATT GGGTCGTATT 200 

ATGGACGTAT TGGGTAATCC TGTTGATGAA GCAGGTCCAA TTGGTTCTGA 2 50 

CCAAACGCGT GCTATTCACC AACCAGCTCC TAAATTTGAT GAACTGTCTA 300 

GCGCAACCGA ATTGCTGGAA ACAGGCATCA AAGTGATTGA CTTGCTTTGC 3 50 

30 CCATTTGCAA AAGGTGGTAA AGTAGGTTTG TTTGGTGGTG CAGGTGTGGG 
CAAAACTGTG AACATGATGG AGTTGATTAA CAACATTGCC AAAGCGCACA 
GTGGTTTGTC TGTATTTGCA GGCGTGGGTG AACGTACTCG CGAAGGTAAT 
GACTTCTATC ACGAGATGAA AGATTCTAAC GTGTTGGATA AAGTTGC CAT 

, c S^S^ GGT CA AATGAATG AACCTCCTGG CAACCGTTTG CGCGTTGCAT 600 

3 5 TGACTGGTTT GTCTATGGCA GAACACTTCC GTGATGAAAA AGACGAAAAT 65 0 

GG £^ GGTC GCGATGTATT GTTCTTTGTG GACAACATCT ATCGCTACAC 70 0 

ATTGGCAGGT ACAGAAGTAT CGGCATTGCT GGGTCGTATG CCCTCTGCGG 75 0 

TAGGTTATCA AC CAACATTG GCAGAAGAAA TGGGTCGTTT GCAAGAGCGT 80 0 

ATTACTTCAA CGCAAACAGG TTCGATTACT T 800 



400 
450 
500 
550 



40 



831 



50 



2) INFORMATION FOR SEQ ID NO: 313 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 
55 (B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 

ATGCCGTACC ACGCGTGTAC GAAGCCCTTG AGGTACAGAA TGGTAATraa en 
60 GTTCTGGTGC TGGAAGTTCA GCAGCAGCTG gS?ggSS£ 10 2 
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CATCGCCATG GGTTCTTCTG ATGGTCTGCG CCGCGGTCTG GATGTAAAAG 150 

ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT 2 00 

ATCATGAACG TACTGGGTCA ACCGGTTGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAG CGT TGGGCTATCC ACCGCGCGGC ACCGTCCTAT GAAGAGCTGT 3 00 

5 CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CCAAGGGCGG TAAAGTTGGT CTGTTCGGCG GTGCGGGTGT 4 00 

AGGTAAAACT GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ATAAAGTATC 550 

10 CCTGGTGTAC GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 6 00 

CGCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACTGA 700 

AGTATC CGCG CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 75 0 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

15 ACCGGTTCTA TC 812 



2) INFORMATION FOR SEQ ID NO: 314 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Klebsiella ornl thinolytica 

(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 

35 ATGC CGTACC GCGCGTGTAC GATGCTCTTG AGGTACAGAA TGGTAATGAG 50 

AGCCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGTGGTA TCGTACGTGC 100 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTTGAGCA CCCGATCGAA GTCCCGGTTG GTAAAGCAAC GCTGGGTCGT 200 

ATCATGAACG TGCTGGGTCA GCCAATCGAT ATGAAAGGCG ACATCGGCGA 2 50 

4 0 AGAAGAGCGT TGGGCTATTC ACCGTGCAGC TCCGTCCTAT GAAGAGCTGT 3 00 

CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

45 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG ATAAAGTATC 550 

CCTGGTTTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG CCGGTACTGA 700 

AGTATC CGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

50 CCCTGGCGGA AGAGATGGGT GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 



55 2) INFORMATION FOR SEQ ID NO: 315 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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10 



40 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 



GTACCGCGCG TGTACGAGGC TCTTGAGGTA CAAAATGGTA GTGAGAATCT 5n 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATTGTT CGTACCATCG 100 

CCATGGGTTC TTCCGACGGT CTGCGTCGCG GTCTGGAAGT CAAAGACCTC III 

GAGCATCCGA TCGAAGTCCC GGTAGGTAAA GCAACGCTGG GTCGTATCAT 200 

15 GAACGTACTG GGCCAACCGG TAGACATGAA AGGCGACATC GGCGAAGAAG 250 

AGCGTTGGGC GATTCACCGC GCAGCGCCTT CCTACGAAGA GTTGTCAAAC 3oS 

^TCAGGAAC TGCTGGAAAC CGGCATCAAA GTTATCGACC TGATGTGTCC 350 

GTTTGCGAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA III 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC ill 

20 GGTTACTCCG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTC CAACGT TATCGATAAA GTAtSotgG* III 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGcSctg 60S 

ACCGGCCTGA CCATGGCTGA GAAGTTCCGT gacgaaggtc GTGACGTTCT III 

GCTGTTCGTC GATAACATCT ATCGTTACAC CCTGGCCGGT ACTGAAGTAT nil 

25 £COCACTGCT GGGTCGTATG CCTTCAGCGG TAGGTTACCA GCCGAcIStG nil 

SSSSSS CGTTCT GCA ^AACGT ATCACCTCCA C C AAAACGGG III 

813 

30 

2 ) INFORMATION FOR SEQ ID NO: 316 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 822 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 

GATGCCGTAC CGCG CGTGTA CGATGCTCTT GAGGTACAGA ATGGTAATGA 
SSSEE™ CTGGAA GTTC AGCAGCAGCT CGGCGGTGGT ATCGTACGTG 
Kn C ^ G ? CCAT GGGTTCTTCT GACGGTCTGC GTCGTGGTCT GGAAGTTaS 
50 GACCTTGAGC ACCCGATCGA AGTCCCGGTT GGTAAAGCAA CGCTGGGTCG 

tatcatgaac gtgctgggtc agccgatcga tatgaaaggc SSSoggcq 

AAGAAGAGCG TTGGGCTATT CACCGCGCAG CTCCGTCTTA TGAAGaSg 

tccagttctc aggaactgct ggaaaccggc atcaaagtta ?cgac??S? 
cc gtgtccgttc gctaagggcg gtaaagtagg tctgttcgg? ggtgcgggcg 

55 TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCgSg 
CACTCCGGTT ACTCCGTGTT TGCGGGCGTC GGTGAACGTA CTCGTGAGGG 
^GACTTC TACCACGAAA TGACCGACTC CAAC^TCtS GA?SaG*S5 
CCC 1 GGTTTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGT? 
GCTCTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTrrTPi 

cgttctgctg ttcgtcgata acatctatcg t?a?a?c^?g g^?gg¥a??S 6 7 5 o°o 



60 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
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AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 75 0 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCCACCAA 80 0 

AACCGGTTCT ATCACTTCCG TA 822 



2) INFORMATION FOR SEQ ID NO: 317 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 



20 



AGAATGGTAA TGAAGTTCTG GTGCTGGAAG TTCAGCAGCA GCTGGGCGGC 50 

GGTATCGTAC GTACCATCGC CATGGGTTCT TCTGATGGTC TGCGCCGCGG 10 0 

25 TCTGGATGTA AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 15 0 

CAACGCTGGG TCGTATCATG AACGTACTGG GTCAACCGGT TGACATGAAA 2 00 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGCG CGGCACCGTC 2 50 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 00 

TTATCGACCT GATGTGTCCG TTCGC CAAGG GCGGTAAAGT TGGTCTGTTC 350 

3 0 GGCGGTGCGG GTGTAGGTAA AACTGTAAAC ATGATGGAGC TGATC CGTAA 4 00 

CATCGCGATC GAGCACTCCG GTTACTCTGT GTTTGCGGGC GTAGGTGAGC 4 50 

GTACTCGTGA GGGTAATGAC TTCTACCACG AAATGACCGA CTCCAACGTT 500 

ATCGATAAAG TATCCCTGGT GTACGGCCAG ATGAACGAGC CGCCGGGAAA 550 

CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAATTCCGTG 600 

35 ACGAAGGTCG TGACGTACTG CTGTTCGTCG ATAACATCTA TCGTTACACC 650 

CTGGCCGGTA CTGAAGTATC CGCACTGCTG GGTCGTATGC CTTCAGCGGT 700 

AGGTTATCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 750 

TCACCTCCAC CAAAACCGGT TCTATCACCT CCGTA 785 

40 

2) INFORMATION FOR SEQ ID NO: 318 

(i) SEQUENCE CHARACTERISTICS: 

4 5 (A) LENGTH: 759 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 



55 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO: 318 



CTGGTGCTGG AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TACGTWCCAT 50 
CGCTATGGGT TCTTCCGACG GTCTGCGTCG CGGTCTGGAT GTTAAAGATC 100 
60 TCGAGCACCC AATCGAAGTT CCGGTMGGTA AAGCAACMCT GGGTCGTATC 150 
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ATGAACGTAC TGGGTCAKCC AGTMGACATG 
AGAGCGTTGG GCTATCCACC GCGCTGCACC 
GCTCTCAGGA ATTGCTGGAA ACCGGTATCA 
CCGTTCGCTA AGGGCGGTAA AGTCGGTCTG 
5 TAAAACCGTA AACATGATGG AGCTGATCCG 
CCGGTTACTC CGTGTTTGCG GGCGTAGGTG 
GACTTCTACC ACGAAATGAC CGACTCCAAC 
GGTATATGGC CAGATGAACG AGCCACCGGG 
TGACCGGTCT GACCATGGCT GAGAAATTCC 
10 CTGCTGTTCG TCGATAACAT CTATCGTTAC 
ATCTGCWCTG CTGGGTCGTA TGCCTTCAGC 
TGGCGGAAGA GATGGGCGTT CTGCAGGAAC 
GGTTCTATC 



AAAGGCGACA TCGGTGAAGA 2 00 

TTCCTACGAA GAGCTGTCTA 250 

AAGTTATCGA C CTGATGTGT 3 00 

TTCGGTGGTG CSGGTGTTGG 3 50 

TAACATC GCG ATCGAGCACT 4 00 

AACGTACTCG TGAGGGTAAC 4 50 

GTTAT CGATA AAGTATCCCT 5 00 

AAACCGTCTG CGCGTTGCTC 550 

GTGACGAAGG TCGTGACGTA 60 0 

ACCCTGGCCG GTACTGAAGT 650 

GGTAGGTTAC CAGCCGACCC 700 

GTATCACCTC CACCAAGACC 750 

759 



2) INFORMATION FOR SEQ ID NO: 319 

(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera. cryocrescens 

(B) STRAIN: ATCC 3 3435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 



TTCCCTCAGG 
TGGTAATGAA 

3 5 TCGTACGTAC 

GATGTAAAAG 
ACTGGGTCGT 
ACATCGGTGA 
GAAGAGCTGT 

4 0 CGACCTGATG 

GTGCGGGTGT 
GCGATTGAGC 
TCGTGAGGGT 
ATAAAGTTTC 
4 5 CTGCGCGTTG 
AGGTCGCGAC 
CCGGTACAGA 
TACCAGCCGA 
CTCCACCAAA 

50 



ATGCCGTACC 
GTGCTGGTGC 
CATCGCTATG 
ACCTCGAGCA 
ATCATGAACG 
AGAAGAACGT 
CAAGCTCTCA 
TGTCCGTTTG 
AGGTAAAACC 
ACT C CGGTTA 
AACGACTTCT 
CCTGGTTTAC 
CGCTGACTGG 
GTACTGCTGT 
AGTATCTGCA 
CTCTGGCGGA 
ACCGGTTCTA 



GCGTGTGTAC 
TGGAAGTTCA 
GGTTCTTCCG 
CCCGATCGAA 
TACTGGGCCA 
TGGGCTATCC 
GGAACTGCTG 
CGAAGGGCGG 
GTAAACATGA 
TTCTGTGTTT 
ACCACGAAAT 
GGCCAGATGA 
TCTGACTATG 
TCGTCGATAA 
CTGCTGGGTC 
AGAGATGGGC 
TCACCTCCGT 



GAAGCCCTTG 
GCAGCAGCTC 
ACGGTCTGCG 
GTCCCGGTAG 
AC CGGTAGAC 
ACCGTGCAGC 
GAAACCGGCA 
TAAAGTTGGT 
TGGAGCTTAT 
GCGGGCGTAG 
GACCGACTCC 
ACGAGCCACC 
GCTGAGAAGT 
CAT CTATCGT 
GTATGCCTTC 
GTTCTGCAGG 
A 



AGGTTCAGAA 50 

GGC GGCGGTA 100 

TCGTGGTCTG 150 

GTAAAGCAAC 2 00 

ATGAAAGGCG 250 

ACCTTCCTAC 3 00 

T CAAAGTT AT 35 0 

CTGTTCGGTG 4 00 

TCGTAACATC 450 

GTGAACGTAC 500 

AACGTTATCG 55 0 

AGGAAACCGT 600 

TCCGTGACGA 650 

TACAC CCTGG 70 0 

AGCGGTAGGT 750 

AACGTATCAC 800 
831 



2) INFORMATION FOR SEQ ID NO: 320 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 
5 (B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 

GCCGTACCGC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

10 GCTGGTGCTG GAAGTTCAGC AGCAGCTCGG TGGCGGTATC GTGCGTACCA 10 0 

TCGCCATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA AGTTAAAGAT 15 0 

CTCGAGCACC CGATCGAAGT TC CGGT AGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA CTGGGTCACC CGGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCTGCGC CTTCCTACGA AGAGCTGTCC 300 

15 AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGCGGT GCGGGTGTTG 400 

GTAAAAC CGT AAACATGATG GAG CTGATC C GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 55 0 

2 0 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 6 50 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 7 00 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 80 0 

25 CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 321 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 43 56 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 

45 TCGATAAGAA TTTACCTGAT ATTAACAACG CCTTACGTGT AATCAAGTCC 50 

GAAGATGAAA GCATCGTTCT TGAAGTTACA CTTGAACTCG GTGATGGTGT 100 

TTTAAGAACA ATCGCCATGG AATCTAC CGA TGGTCTTCGT CGTGGTATGA 150 

AAGTCGAAGA TACTGGCGCT CCAATTTCAG TTCCAGTTGG AGAAGACACT 200 

TTAGGT CGTG TGTTTAACGT TTTAGGACAG CCTATTGATG GTGGTCCAGC 2 50 

50 CTTTCCAAAG GATCACCCAC GTGAGGGTAT CCACAAGGAA GCACCTAAAT 3 00 

ATGAAGATTT AACTACTAGT CGTGAAATTC TTGAAACTGG TATCAAGGTT 3 50 

ATCGACCTTC TTGAACCATA TGTTCGTGGT GGTAAAGTTG GTTTGTTTGG 4 00 

TGGTGCCGGT GTTGGTAAAA CTACTATTAT TCAAGAATTA ATTCACAACA 4 50 

TCGCTCAAGA ACACGGTGGT ATTTCCGTAT TTACTGGTGT TGGTGAAAGA 50 0 

55 ACTCGTGAAG GTAATGACCT TTACTTTGAA ATGAAAGCTT CAGGCGTTTT 55 0 

AAGTAAGACT GCCATGGTAT TTGGT CAGAT GAACGAGCCG CCTGGTGCCA 600 

GAATGCGTGT TGCATTAACC GGTTTGACAC TTGCTGAATA CTTTAGAGAT 65 0 

GTTGAAGGTC AAGACGTATT GCTCTTTATT GACAATATCT TTAGATTTAC 700 

TCAGGCTGGT TCAGAGGTAT CTGCTTTGCT TGGTCGTATG CCAAGTGCCG 75 0 

60 TAGGTTATCA GCCAACTTTG GCAACAGAAA TGGGTCAATT G CAGG AAAG A 8 00 

323 
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ATTACTTCTA CTAAGAAGGG TTCAATTACT TCAA 



5 2) INFORMATION FOR SEQ ID NO: 322 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 

20 

TTCCTCGTGA TAGCGTGCCT AAAGTCAATG ATGCGTTAAA GCTTGTTGAT 50 

AGTGATCTGG TTTTTGAAGT GCAGCAGCAA CTTGGAGACG GAGTTGTGCG 100 

TACTATTGCC ATGGGAACAA CCGATGGTTT AAAGCGAGGA TTAAAAGCAG 150 

AAAATACAGG CCATCCTATT CAAGTGCCAG TAGGTAAGAA AACTTTGGGA 2 00 

25 CGCATTATGG ATGTTCTTGG GCGTCCTGTA GATGATGCTG GGCCTATCGA 250 

TGCTGAAGAG ACTTGGGCTA TT CAT CGTAA AG CAC CAAGT TATGAAGAGC 300 

AAGCTGGCAG CCAGGAATTA TTGGAAACTG GTATTAAAGT AATTGATTTG 35 0 

CTTTGCCCTT TTGCCAAGGG AGGTAAAGTT GGTCTATTCG GTGGTGCCGG 40 0 

TGTAGGCAAA AC CGTTAACA TGATGGAATT AATACGAAAC ATTGCAATTG 4 50 

3 0 AGCATAGCGG TTATTCAGTG TTTGCAGGGG TTGGTGAACG TAC CCGTGAA 500 

GGAAACGACT TCTATCATGA GATGAAAGAC TCTAATGTAT TGGATAAAGT 550 

ATCGCTTGTT TATGGT CAGA TGAATG AGC C GCCAGGAAAC CGTTTGCGTG 600 

TTGCTCTAAC CGGTTTGACT ATGGCTGAAA AATTCCGGGA TGAAGGGCGA 65 0 

GACGTTCTTT TGTTTATCGA TAATATTTAT CGTTATACCT TGGCTGGGGT 700 

3 5 TGAAGTATCT GCGCTGTTAG GCCGTATGCC TTCTGCAGTA GGATATCAGC 750 

CGACATTAGC AGAGGAAATG GGTATGCTGC AAGAG CGCAT TACCTCCACA 800 

AAAACAGGTT CTATTACTT C CATA 824 



40 

2) INFORMATION FOR SEQ ID NO: 323 



45 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33999 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 



60 



GACGCCGTAC CGAAAGTGTA CGATGCGCTT GAAGTTCAAA TTGATGCCAA 
GCTGGTTCTG GAAGTTCAAC AGCAGCTCGG CGGCGGCGTT GTTCGCTGCA 
TCGCGATGGG TACTTCAGAC GGCTTAAGCC GCGGTCTGGA CGTGCTCGAT 
CTGGAACACC CGATTGAAGT ACCGGTGGGC AAAGCGACGC TGGGCCGCAT 



50 
100 
150 
200 
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raTPAACGTG CTTGGTCACC CTATCGACAT GAAGGGCGAC ATCGGCGAAG 2 50 

SJS^TTC GGCTATTCAC CGCGCAGCGC CGAGCTACGA AGACCTGTCG 3 00 

G^cScScJg SsCTOCTOGA GACCGGCATC AAGGTTATCG ACCTGATTTG 350 

S^TTCGCC AAGGGCGGTA AAGTCGGCCT GTTCGGCGGC GCCGGCGTAG 400 

c- 5tSaIc2gT AAACATGATG GAGCTCATTC GCAACATTGC GACCGAGCAC 450 

TCC^TACT CCGTGTTTGC AGGCGTAGGT GAACGTACCC GTGAGGGTAA 500 

ScT^CTAC ScGAAATGA CTGAATCCAA CGTATTGGAC AAGGTGTCGC 550 

SStATACGG TCAGATGAAC GAGCCGCCTG GAAACCGTCT GCGCGTAGCG 600 

??S5gGGCT TGACCATGGC GGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

10 TCtcScStT ATCGACAACA TTTACCGCTA TACCCTGGCC GGTACGGAAG 700 

StCCGCACT GCTGGGC CGT ATGCCTTCAG CCGTAGGCTA CCAGCCGACT 750 

CTGgSgaS AAATGGGCGT GCTTCAAGAG CGTATTACCT CTACCAAGAC 800 
GGGGTCTATC ACCTCCGT 



15 



50 



2 ) INFORMATION FOR SEQ ID NO: 325 



( i ) SEQUENCE CHARACTERI STICS : 
55 (a) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

325 



2) INFORMATION FOR SEQ ID NO: 324 

( i ) SEQUENCE CHARACTERI STICS : 
20 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 

GTGGAAACTT AC CTGAAATC TACAATGCCC TAGTTATTGA ATATAAATCT 50 

GATGCAGAAG AAGC AC CAAC TAGCCAACTT ACTTTAGAAG TAGCCATCCA 100 

ATTAGGTGAT GATGTTGTAC GTACAATCGC AATGGCATCA ACAGATGGTG 150 

^ TTCAAAGAGG TATGGAAGTT ATTGATACTG GGAGCC CAAT TACAGTTCCT 2 00 

GTAGGTACAG TAACTCTTGG TCGTGTATTT AATGTATTAG GAAACACCAT 250 

CGATTTGGAC GAACCACTTC CAAGCGATAT TAAACGTAAT AAAATTCACC 3 00 

GCGAAGCTCC AACTTTCGAT CAATTAGCAA CGACAACAGA AATT CTTGAA 350 

ACAGGAATTA AAGTAGTTGA CTTGCTAGCT CCTTACTTAA AAGGTGGTAA 400 

4 0 AATCGGATTG TTCGGTGGTG CCGGTGTTGG TAAAACCGTT CTAATTCAAG 450 

AGCTTATCCA TAATATCGCA CAAGAACATG GTGGTATTTC TGTGTTCGCT 500 

GGCGTTGGAG AACGTACTCG TGAAGGTAAC GACCTTTACT TCGAAATGAA 550 

AGATTCAGGC GTTATTGAAA AAACAGCGAT GGTATTCGGT CAAATGAACG 600 

AGCCAC CAGG TGCGCGTATG CGTGTTGCCT TAACTGGTCT AACAATTGCT 650 

45 GAATATTTCC GTGATGAAGA ACATCAAGAT GTACTTTTAT TCATTGATAA 700 

CATTTTCCGT TTCACACAAG CTGGTTCAGA GGTTTCGGCT TTACTAGGTC 750 

GTATGCCATC TGCGGTAGGT TAC CAACCAA CCCTAGCTAC TGAAATGGGT 800 

CAACTACAAG AACGTATTAC ATCTACTAAC GTTGG 835 



CA 02283458 1999-09-28 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus lylae 

(B) STRAIN: ATCC 27566 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 

CCCGCGTGGC GAGTTGCCGG CACTGTTCAA CGCGCTGACT GTCGAGGTCA 5 0 

CCCTCGAAGC AGTCGCTAAA ACCATTACCC TTGAGGTTGC TCAGCACCTC 10 0 

10 GGCGACAACT TGGTTCGTGC CGTGTCCATG GCACCGACCG ACGGTCTCGT 150 

CCGTGGCGCT GCTGTGATCG ACAGCGGTAA GCCGATCTCA GTTCCCGTTG 200 

GTGACGTAGT CAAGGGACAC GTCTTCAACG CTCTGGGTGA TTGCCTCGAT 250 

GAGCCAGGTC TTGGCCGTGA CGGTGAGCAG TGGGGCATCC ACCGCGATCC 3 00 

GCCACCTTTT GACCAGCTTG AGGGTAAGAC CGAGATTCTG GAAACCGGTA 350 

15 TTAAGGTCAT CGAC CTGCTG ACCCCGTATG TTAAGGGCGG CAAGATCGGC 400 

CTGTTCGGTG GTGCTGGTGT GGGTAAGACC GTTCTTATCC AGGAAATGAT 450 

CACCCGTATC GCTCGCGAGT TCTCCGGTAC CTCGGTGTTC GCAGGCGTGG 500 

GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTGGAAAT GGAAGAGATG 550 

GGCGTTCTCC AGGACACCGC TCTTGTGTTC GGC CAGATGG ACGAGCCTCC 600 

20 AGGAGTT CGT ATGCGCGTGG CGCTGTCCGG CCTGACCATG GCGGAGTACT 65 0 

TCCGCGATGT GCAGCACCAG GACGTGCTTC TGTTCATCGA CAACATCTTC 70 0 

CGTTTCACCC AGGCAGGTTC CGAGGTTTCC ACCCTCCTAG GCCGCATGCC 750 

TTCTGCCGTG GGTTACCAGC CAACGCTGGC AGACGAGATG GGTGTTCTGC 800 

AGGAGCGTAT TACCTCCACA AAGGGTAA 828 

25 



2) INFORMATION FOR SEQ ID NO: 326 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsensis 
40 (B) STRAIN: ATCC 35017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 

GATGC CGTAC CAAAAGTGTA CGATGCTCTT GAGGTTCTTA ACGGTAAAGA 50 

45 AAAATTGGTG CTGGAAGTTC AGCAACAATT AGGCGGTGGT GTTGTTCGTT 10 0 

GTATCGCAAT GGGTACATCA GATGGTTTAA GCCGCGGTTT AGAAGTTAAA 150 

AATACAGATC ATC CGATCGA AGTTCCTGTC GGTGTTAAAA CGCTTGGCCG 200 

TATCATGAAC GTGCTGGGTG AC CCAATCGA CATGAAAGGT GATAT CGGCG 250 

AAGAAGAACG CTGGTCAATT CACCGCGCAG CACCAAGCTA TGAAGATCTG .300 

5 0 GCTAACTCAA CAGAACTTCT AGAAACAGGT ATCAAAGTTA TGGAC CTGAT 3 50 

TTGCCCATTC GCTAAAGGGG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TCGGTAAAAC AGTTAACATG ATGGAGCTTA TTCGTAATAT CGCGATTGAG 450 

CACTCAGGTT ATTCTGTATT CGCGGGTGTT GGTGAACGTA CTCGTGAAGG 500 

TAACGATTTC TACCATGAAA TGACAGACTC AAACGTTCTG GATAAAGTTT 55 0 

55 CATTGGTTTA TGGCCAGATG AATGAGCCAC CAGGAAACCG TCTGCGTGTT 600 

GCTCTGACTG GTCTGACTAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 65 0 

CGTACTGTTA TTCGTAGATA ATATTTATCG TTATAC CTTA GCAGGGACAG 700 

AAGTATCTGC ACTGCTGGGT CGTATGCCTT CAGCGGTGGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCGACTAA 8 00 

60 GACCGGCTCT ATCACTTCCG TA 822 

326 
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2) INFORMATION FOR SEQ ID NO: 327 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Branhamella catarrhalis 

(B) STRAIN: ATCC 43628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 

2 0 CCGTGGCGAT GTCCCCCAAA TCTTTGATGC ACTTCATGTT GATGGTACTG 50 

^ AAACCACCCT TGAAGTCCAA CAACAGTTAG GTGATGGTGT GGTGCGTACC 100 

ATTGCCATGG GTTCTACCGA AGGCTTAAAG CGTGGCTTGC CTGTCTCTAA 150 

TTCAGGTGCA CCCATTTCGG TACCAGTCGG TCAAGCAACA CTGGGTCGCA 2 00 

TTATGGATGT CCTAGGTCGC CCAATCGATG AAGCAGGTCC GGTAAATGCT 250 

2 5 GAACAAAAAT GGTCCATTCA TCGTGAAGCA CCAAGTTATG ATGAACAGTC 3 00 

AAATAGTACA GAAC TTTTAG AAAC AGG CAT CAAAGTGATT GATTTGCTTT 35 0 

GTCCATTTGC CAAAGGTGGT AAAGT CGGTC TGTTCGGTGG TGCTGGTGTT 400 

GGTAAGACCG TTAACATGAT GGAGCTTATC AATAATATCG CCCTAAAACA 450 

CTCAGGTCTG TCGGTTTTTG CTGGTGTGGG TGAGCGTACT CGTGAGGGTA 500 

3 0 ATGACTT CTA CCATGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAT 550 

TTTACTCAGT CAAAAGTTGC CATGGTTTAT GGTCAGATGA ATGAGCCACC 60 0 

AGGAAACCGT CTGCGTGTTG CCTTAACTGG TTTGACCATG GCAGAGTATT 650 

TCCGTGATGA AAAAGACGAA GCAACGGGCA AAGGCCGTGA TGTTCTGCTG 700 

TTCGTTGATA ATATTTATCG TTACACATTG GCAGGTACTG AGGTATCAGC 750 

3 5 ACTTTTAGGT CGTATGCCAT CTGCGGTAGG TTATCAGCCG ACTTTGGCCG 800 

AAGAGATGGG CTTGCTACAA GAG CGTATCA CCTCCACCCA ATCAGGCTCA 850 
ATTA 



40 



50 



2) INFORMATION FOR SEQ ID NO: 32 8 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 831 bases 
45 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



854 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moraxella osloensis 

(B) STRAIN: ATCC 19976 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 

CCGTCAAAGC GTACCAAGAA TTTATGATGC CTTAAAAGTT GAAGGCACAG 50 

AAACTACATT AGAAGTACAA CAACAATTGG GTGATGGTAT CGTACGTACT 100 

ATTGCCATGG GTTCTACTGA AGGTCTAAAA CGTGGTCTAC CAGTTAGCAA 15 0 

60 CACTGGCGCA CCAATCTCTG TACCTGTGGG TAAAGGTACA CTAGGTCGTA 200 
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TCATGGACGT TTTAGGACAC CCAATCGATG AGGCAGGTCC GGTAGAGCAT 250 

AGTAACACTT GGGCGATTCA CCGTGAAGCG CCAAGCTATG ATGAACAATC 3 00 

AAACTCTACT GAACTTTTAG AAAC CGGTAT TAAAGTAATT GACTTACTAT 3 50 

GCCCATTTGC TAAAGGTGGT AAAGTCGGTC TGTTCGGTGG CGCGGGTGTT 4 00 

5 GGTAAAACCG TTAACATGAT GGAACTTATC AATAACATCG CAAAAGCACA 4 50 

CTCAGGTTTA TCGGTATTTG CTGGTGTAGG TGAGCGTACT CGTGAAGGTA 500 

ATGACTTCTA CCACGAGATG AAAGACTCAA ACGTACTTGA TAAAGTTGCG 550 

ATGGTGTATG GTCAGATGAA TGAGCCACCA GGAAACCGTT TACGTGTTGC 6 00 

CCTGACAGGT TTAACCATGG CAGAATACTT CCGTGACGAA AAAGATGAAA 65 0 

10 ACGGTAAAGG TCGTGACGTA TTATTGTTCG TTGACAATAT TTATCGTTAC 7 00 

ACGCTAGCGG GTACCGAAGT ATCAGCATTA TTAGGTCGTA TGCCATCTGC 750 

AGTAGGGTAT CAGCCAACGC TTGCAGAAGA GATGGGTGTA CTACAAGAAC 800 

GTATTACTTC AACCCAATCA GGCTCTATTA C 831 



15 



30 



2) INFORMATION FOR SEQ ID NO: 32 9 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morganii 

(B) STRAIN: ATCC 25830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 



CGAATTTCCT CAGGATGCAG TACCGAAAGT GTACGATGCG CTTGAGGTAA 50 

CAAATGGTAA AGAAAAACTG GTGCTGGAAG TTCAGCAGCA GTTAGGCGGC 100 

35 GGGGTTGTCC GTTGTATCGC TATGGGTACA TCTGATGGTC TGAGCCGTAA 150 

TCTGGAAGTA ACCGATTTAG GCCACCCGAT CGAAGTCCCT GTCGGCGTGA 2 00 

AAACCTTAGG ACGTATCATG AACGTTCTGG GTGATCCGAT CGATATGAAA 2 50 

GGTGACATCG GCGCAGAAGA AAAATGGTCT ATTCACCGTG CTGCACCAAC 3 00 

ATACGAAGAA CTGTCTAACT CCCAGGAACT GCTGGAAACA GGTATCAAAG 3 50 

4 0 TAATGGACCT GATCTGCCCG TTCGCGAAGG GTGGTAAAGT CGGTCTGTTC 4 00 

GGTGGTGCGG GTGTGGGTAA AACCGTAAAC ATGATGGAAC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCTGT ATTCGCAGGG GTCGGTGAGC 500 

GTACCCGTGA AGGTAACGAC TTCTATCATG AAATGACAGA CTCCAACGTT 550 

CTGGACAAAG TATCACTCGT GTACGG CCAG ATGAACGAGC C AC CGGGAAA 600 

4 5 CCGTCTGCGC GTTGCTCTGA CCGGTCTGAC CATGGCGGAA AAATTCCGTG 650 

ATGAAGGCCG CGATGTACTG CTGTTCGTTG ATAACATCTA CCGTTATACC 700 

CTGGCCGGTA CTGAAGTATC CGCGCTGTTA GGCCGTATGC CTTCAGCGGT 750 

AGGTTAC CAG CCGACACTGG CGGAAGAAAT GGGTGTGCTT CAGGAACGTA 800 

TCACATCGAC CAAAACAGGC TCTATCACGT CTGTA 835 

50 



2) INFORMATION FOR SEQ ID NO: 330 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



328 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 
5 (B) STRAIN: ATCC 27155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 

GACGCGGTAC CGCAAGTGTA CAGCGCCCTC GAGGTTATGA ATGGTGATGC 50 

10 GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGC GTAGTACGTA 100 

CCATCGCAAT GGGTACGTCT GACGGCCTGA AGCGTGGTCT GAGCGTCAAC 150 

GACCTGCAGA AACCGATTCA GGTACCCGTC GGTAAAGCGA CCCTGGGCCG 2 00 

TATCATGAAC GTTCTCGGCG AGCCAATCGA TATGAAAGGC GAGCTGAAAG 250 

AAGAAGATGG CAGCGCAGTA GAGATCGCCT CTATTCACCG CGCAGCCCCT 3 00 

15 TCTTATGAAG ATCAGTCTAA CTCGCAGGAA CTGCTGGAAA CCGGCATCAA 350 

GGTTAT CGAC CTGATGTGTC CGTTTGCTAA AGGCGGTAAA GTCGGTCTGT 4 00 

TCGGTGGTGC GGGTGTAGGT AAAACCGTCA ACATGATGGA ACTGATCCGT 4 50 

AACATCGCGG CTGAACACTC AGGTTAC TCA GTGTTTGCCG GTGTGGGTGA 500 

GCGTACTCGT GAGGGTAACG ACTTCTACCA CGAAATGACT GACTCTAACG 5 50 

2 0 TTATCGATAA AGTTGCACTG GTCTATGGCC AGATGAACGA GCCGCCGGGT 600 

AACCGTCTGC GCGTAGCACT GACCGGTCTG AC C ATGGCGG AAAAATTCCG 650 

TGATGAAGGT CGCGACGTTC TGCTGTTCAT CGATAACATC TACCGTTATA 700 

CCCTGGCCGG TACAGAAGTT TCTGCACTGC TGGGTCGTAT GCCATCTGCG 750 

GTAGGTTACC AGC CAACGCT GGCAGAAGAG ATGGGTGTGT TGCAGGAGCG 8 00 

25 TATTACCTCC ACCAAGACCG GTTC 824 



30 



2) INFORMATION FOR SEQ ID NO: 331 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 8 08 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Pantoea dlspersa 

(B) STRAIN: ATCC 14589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 

4 5 TATACAGCGC TCTCGAGGTA AAAAATGGTG ATGCTCGTCT GGTGCTGGAA 50 

GTACAGCAGC AGCTGGGCGG TGGCGTGGTG CGTACCATCG CCATGGGTTC 100 

TTCTGACGGC CTGAAGCGCG GTCTGGAAGT CACCGACCTG AAAAAACCTA 150 

TCCAGGTTCC GGTTGGTAAA GCAACACTCG GCCGTATCAT GAACGTGCTG 20 0 

GGTGAGCCAA TCGACATGAA AGGCGAC CTG AAAGAAGAAG ACGGCAGCGC 250 

50 TGTAGAGGTT TCCTCTATTC ATCGCGCAGC GCCTTCTTAT GAAGATCAGT 300 

CAAACTCGCA GGAACTGCTG GAAACCGGCA TCAAGGTTAT CGACCTGATG 350 

TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGGCTGAGC 450 

AC TCAGG TT A TTCGGTCTTT GCCGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

55 AACGACTTCT ACCACGAAAT GACGGACTCC AACGTTATCG ATAAAGTAGC 550 

GCTGGTGTAT GGC CAGATGA ACGAGCCGCC GGGTAACCGT CTGCGCGTAG 60 0 

CACTGACCGG TCTGACCATG GCGGAAAAAT TCCGTGATGA AGGC CGTGAC 650 

GTTCTGCTGT TCATCGACAA CATCTACCGT TACACCCTGG CCGGTACAGA 700 

GGTTTCTGCA CTGCTGGGTC GTATGC CATC GGCGGTAGGT TATCAGCCAA 750 

60 CGCTGGCTGA AGAGATGGGT GTGCTGCAGG AGCGTATTAC CTCCACCAAG 800 
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i ■ 



ACCGGTTC 808 



5 2) INFORMATION FOR SEQ ID NO: 332 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella multocida 

(B) STRAIN: NCTC 10322 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 



GATGCAGTAC CAAAAGTATA TGATGCCTTA AATGTTGAAA CAGGTTTAGT 50 

ACTTGAAGTT CAACAACAAT TAGGTGGTGG TGTAGTTCGC TGTATCGCAA 10 0 

TGGGATCATC TGATGGATTA AAACGCGGTT TAAGCGTAAC AAATACGAAT 15 0 

AACCCAATTT CTGTTC CAGT GGGAACGAAA ACATTGGGTC GTATCATGAA 20 0 

2 5 CGTATTGGGT GAACCAATCG ATGAGCAAGG TGAAATCGGT GCAGAAGAGA 2 50 

ATTGGTCTAT TCACCGTGCG CCACCAAGTT ATGAAGAACA AT CT AACAGT 3 00 

ACTGAACTTT TAGAAACGGG AATTAAAGTT ATCGACTTAG TTTGTCCGTT 35 0 

TGCGAAAGGG GGTAAAGTAG GTTTATTCGG TGGTGCGGGT GTCGGTAAAA 40 0 

CCGTCAATAT GATGGAATTA ATCCGTAACA TCGCAATTGA GCACTCAGGT 4 50 

3 0 TACTCTGTCT TTGCGGGGGT AGGTGAGCGT ACGCGTGAAG GTAACGACTT 5 00 

CTATCATGAG ATGAAAGACT CTAACGTATT AGATAAAGTG TCTCTTGTTT 550 

ATGGTCAAAT GAACGAGCCA CCAGGTAACC GTTTACGTGT GGCATTAACA 60 0 

GGCTTAACTA TGGCGGAAAA ATTCCGTGAT GAAGGTCGTG ATGTCTTATT 650 

CTTCGTTGAT AATATTTATC GTTATACTCT TGCTGGTACA GAAGTTTCTG 700 

35 CATTATTAGG TCGTATGCCA TCTGCGGTAG GTTATCAACC AACCCTTGCA 75 0 

GAAGAAATGG GTGTTCTGCA AGAG CGTATT ACCTCAACCA AAACAGGTTC 80 0 

TATTA 8 05 

40 

2) INFORMATION FOR SEQ ID NO: 3 33 

( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 828 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 

(B) STRAIN: ATCC 49100 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 

TTTCCTCAAG ACGCCGTACC AAAAGTGTAC GACGCGCTTG AAGTTCAGAA 50 

CGATGCCAAG CTGGTGCTGG AAGTTCAACA ACAGCTCGGT GGTGGTGTCG 100 

TTCGTTGTAT CGCAATGGGT ACTTCCGATG GCTTAAGCCG CGGTTTAAAA 150 

60 GTGCTTGATT TAGAACATCC TATCGAAGTA CCGGTTGGTA CTGCGACGCT 200 
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GGGCCGTATT ATGAACGTGC TCGGTCAGCC AATCGATATG AAAGGCGATA 250 

TTGGTGAAGA AGAGCGTTGG GCTATTCACC GTGAAGCACC AAGTTATGAA 3 00 

GATTTATCTG GCGCCAATGA ACTGCTGGAA ACGGGTATCA AGGTTATCGA 3 50 

CCTGATTTGT CCGTTTGCTA AAGGTGGTAA AGTTGGTCTG TTTGGTGGTG 4 00 

5 CGGGTGTAGG TAAAACCGTA AACATGATGG AGCTGATTCG TAACATTGCG 4 50 

ACTGAGCACT CAGGTTACTC CGTATTCGCC GGTGTAGGGG AACGTACCCG 500 

TGAAGGTAAT GACTTCTACC AC GAAATGAC CGAATCAAAC GTACTGGATA 550 

AAGTATCTCT GGTTTATGGC CAGATGAACG AGCCACCAGG AAACCGTCTG 600 

CGCGTGGCGT TAACGGGTTT GACCATGGCT GAAAAATTCC GTGATGAAGG 650 

10 TCGTGACGTT CTGTTATTTA TCGATAACAT TTATCGCTAT ACCTTAGCCG 700 

GTACCGAAGT ATCAGCACTG TTGGGGCGTA TGCCATCAGC GGTAGGTTAT 750 

CAGCCAACGT TAGCAGAAGA GATGGGTGTG TTGCAGGAAC GTATTACTTC 800 

AACCAAAACC GGTTCAATCA CTTCTGTA 828 



15 



2 ) INFORMATION FOR SEQ ID NO: 334 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334 



GTCCCTAAAG TATACGACGC TCTTGAGGTT ATGAATGGTA AAGAAAAACT 50 

GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTATCGTT CGTTGTATCG 100 

35 CAATGGGTAC ATCAGACGGT TTAAGCCGTG GCTTAAAGGT TGAAGATTTA 150 

GGCCACCCAA TTGAAGTACC AGTAGGTAAA GCGACTTTAG GACGTATCAT 2 00 

GAACGTTCTG GGTACAC CTA TTGATATGAA AGGTGAGATT GAAACCGAAG 250 

AGCGTTGGTC AATCCACCGT GAAGCAC CAA CTTACGAAGA GTTATCAAAC 300 

TCT CAAGAAC TGCTTGAAAC CGGTATCAAA GTTATGGACT TAATCTGTCC 350 

4 0 ATTTGCTAAA GGTGGTAAAG TCGGTCTGTT CGGTGGTGCG GGTGTTGGTA 400 

AAACAGTTAA CATGATGGAA TTGATCCGTA ATATCGCGAT CGAGCACT CA 4 50 

GGTTACTCTG TATTTGCTGG TGTTGGTGAG CGTACTCGTG AGGGTAACGA 500 

CTTCTATCAT GAAATGACAG ATTCTAACGT TCTTGACAAA GTATCGTTAG 5 50 

TTTACGGTCA GATGAATGAG CCAC CAGGAA ACCGTCTGCG TGTTGCACTG 600 

4 5 ACTGGTCTGA CTATGGCTGA GAAATTCCGT GATGAAGGCC GTGACGTACT 650 

GTTATTCGTC GATAACATCT ATCGTTACAC CTTAGCCGGT ACAGAAGTAT 700 

CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA . ACCAACATTG 750 

GCTGAAGAGA TGGGTGTTCT GCAAGAGCGT ATCACTTCAA CCAAAACAGG 800 

TTCTATC 807 

50 



2 ) INFORMATION FOR SEQ ID NO: 335 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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> • 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335 



CCCTAAAGTA 

10 TGCTAGAAGT 
ATGGGTACAT 
CCACCCAATT 
ACGTTCTGGG 
CGTTGGTCTA 

15 TCAAGAACTA 
TTGCTAAAGG 
ACAGTTAACA 
TTATTCTGTA 
TCTATCATGA 

2 0 TATGGTCAGA 
GGGTTTAACC 
TATTCGTCGA 
GCACTGTTAG 
TGAAGAGATG 

2 5 CAATCACCTC 



TACGACGCTC 
TCAGCAACAG 
CAGACGGTTT 
GAAGTACCAG 
TACACCTATC 
TTCACCGCGA 
CTAGAAACCG 
TGGTAAAGTA 
TGATGGAATT 
TTTGCAGGTG 
AATGACAGAT 
TGAATGAGCC 
ATGGCGGAAA 
TAACAT CTAT 
GCCGTATGCC 
GGTGTTCTGC 
T 



TTGAGGTTAT 
TTAGGCGGTG 
AAGCCGTGGC 
TAGGTAAAGC 
GATATGAAAG 
AGCGCCAACC 
GTATCAAAGT 
GGTCTCTTCG 
GATCCGTAAC 
TTGGTGAGCG 
TCTAACGTTC 
ACCAGGAAAC 
AATTCCGTGA 
CGTTACACCT 
ATCAGCAGTA 
AAGAACGTAT 



GAATGGTAAA 
GTATCGTTCG 
TTAAAAGTTG 
AACACTGGGA 
GTGATATTGC 
TATGAAGAGT 
AATGGACTTA 
GTGGTGCGGG 
ATCGCGATTG 
TACTCGTGAG 
TTGACAAAGT 
CGTCTACGTG 
TGAAGGCCGT 
TAGCCGGTAC 
GGTTACCAAC 
CACTTCAACC 



GAGAAACTGG 5 0 

TTGTATCGCA 100 

AAAAC TTAGG 150 

CGTATCATGA 2 00 

AACTGAAGAA 250 

TATCAAGCTC 3 0 0 

ATCTGTCCGT 3 50 

TGTTGGTAAA 4 00 

AGCACTCAGG 4 50 

GGTAACGACT 500 

ATCGTTAGTT 5 50 

TAGCACTGAC 600 

GACGTACTGT 650 

CGAAGTATCA 7 00 

CAACATTGGC 750 

AAAACAGGTT 8 00 
811 
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2) INFORMATION FOR SEQ ID NO: 336 



( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Providencia alcalifaciens 

(B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336 

45 TCAAGATAAC GTACCAAAAG TGTACGATGC TCTTGAGGTT ATTAACGGTA 50 

AAGAAAAACT GGTGTTGGAA GTTCAACAAC AGTTAGGTGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGT CTGAGCCGTG GTTTAGAAGT 150 

TGTAAACTTA GAGCACCCAA TCGAAGTACC AGTCGGTAAA GCAACTCTGG 2 00 

GACGTATCAT GAACGTTCTG GGTGAACCAA TCGACATGAA AGGTGATATC 250 

50 GGCGAAGAAG AGCGCTGGTC TATTCACCGT GCTGCACCAA GCTACGAAGA 300 

ATTAGCTAAC TCAACTGAAC TGCTGGAAAC CGGTATCAAA GTAATGGACT 3 50 

TAATCTGTCC ATTCGCGAAA GGTGGTAAAG TAGGTCTGTT CGGTGGTGCG 4 00 

GGTGTTGGTA AAACCGTAAA CATGATGGAA CTGATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TGTTCGCTGG TGTTGGTGAG CGTACCCGTG 50 0 

55 AAGGTAACGA CTTCTATCAT GAAATGACAG ACTCAAACGT TCTGGATAAA 550 

GTATCACTGG TTTATGGCCA GATGAACGAG CCACCAGGAA ACCGTCTGCG 600 

TGTTGCGCTG ACTGGTCTGA CTATGGCTGA AAAATTCCGT GACGAAGGTC 650 

GTGACGTACT GCTGTTCGTT GACAACATTT ATCGTTATAC ACTGGCAGGT 70 0 

ACTGAAGTAT CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTAC CA 7 50 

60 ACCAACGCTG GCGGAAGAGA TGGGTGTTCT TCAAGAACGT ATTACCTCAA 8 00 
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CTCAAA 806 



5 2) INFORMATION FOR SEQ ID NO: 337 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 



TTCCCTCAAG ATGACGTACC AAAAGTGTAC GACGCTCTTG AGGTTGTTAA 50 

CGGTAAAGAA ACACTGGTGC TGGAAGTTCA GCAACAGTTA GGCGGTGGTG 10 0 

TTGTCCGTTG TATCGCAATG GGTACATCAG ATGGCCTGAG CCGTGGTTTA 15 0 

GAAGTTGTAA ACTTAGAGCA CCCAATTGAA GTACCAGTAG GTAAAGCAAC 2 00 

25 TTTAGGACGT ATCATGAACG TTCTGGGTCA GCCTATTGAT ATGAAAGGTG 2 50 

ATATCGGCGA AGAAGAGCGC TGGTCAATTC ACCGTGCTGC ACCTAGCTAC 3 00 

GAAGAGTTAG CTAACTCAAC AGAGCTGCTG GAAACCGGTA TCAAAGTAAT 3 50 

GGACTTAATC TGTCCATTCG CGAAAGGTGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAACTGAT CCGTAACATC 4 50 

3 0 GCGATTGAGC ACTCAGGTTA CTCAGTATTC GCTGGTGTTG GTGAGCGTAC 50 0 

TCGTGAAGGG AACGACTTCT AT CATGAAAT GACTGACTCA AACGTTCTGG 550 

ATAAAGTATC ACTGGTTTAT GGCCAGATGA ATGAGCCACC AGGAAACCGT 60 0 

CTGCGCGTTG CGTTGACTGG TCTGACTATG GCTGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTACTGT TCGTTGACAA CATCTATCGT TATACACTGG 700 

35 CAGGTACTGA AGTATCAGCA CTGTTAGGTC GTATGCCTTC AGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGT GTTCTGCAAG AACGTATTAC 800 

CTCAACTCAA ACGGGTTCTA TCACTTCCGT 83 0 



40 



50 



2) INFORMATION FOR SEQ ID NO: 338 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia, ruetigianii 

(B) STRAIN: ATCC 33673 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 

AGCGTACCAA AAGTGTACGA TGCTCTTGAG GTTATTAACG GTAAAGAAAA 50 

ACTGGTGTTG GAAGTTCAGC AGCAGTTAGG CGGTGGTGTT GTCCGTTGTA 100 

TCGCAATGGG TACATCAGAT GGTCTGAGCC GTGGTTTAGA AGTTGTAAAC 150 

60 TTAGAACAC C CAATTGAAGT AC CAGTAGGT AAAGCAACTC TGGGACGTAT 2 00 
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CATGAACGTT CTGGGTGACC CTATTGATAT GAAAGGTGAT ATCGGCGAAG 2 50 

AAGAGCGCTG GTCTATTCAC CGTTCAGCGC CAAGC TATGA AGAATTAGCT 3 00 

AACTCAACAG AAC TGCTAGA AACCGGTATC AAAGTAATGG ACTTAATCTG 350 

TCCATTCGCG AAAGGTGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTTG 4 00 

5 GTAAAACAGT AAACATGATG GAACTGATCC GTAACATCGC GATTGAGCAC 4 50 

TCAGGTTACT CAGTATTCGC TGGTGTTGGT GAGCGTACCC GTGAAGGTAA 5 00 

CGACTTCTAT CATGAAATGA CTGATTCTAA CGTTCTGGAT AAAGTATCAC 550 

TGGTTTATGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGTGTTGCG 6 00 

CTGACTGGTC TGACTATGGC TGAAAAATTC CGTGACGAAG GTCGTGACGT 650 

10 ACTGCTGTTC GTTGACAACA TTTATCGTTA TACAC TGGCA GGTACTGAAG 700 

TATCAGCACT GTTAGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCAACA 750 

TTGGCAGAAG AGATGGGTGT TCTACAAGAA CGTATCACTT CTACCAAAAC 8 00 

CGGTTCTATC AC 812 



15 
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2) INFORMATION FOR SEQ ID NO: 33 9 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 

(B) STRAIN: ATCC 33672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 



TCAAGATGCA GTACCAAAAG TGTACGATGC GCTTGAGGTT GTTAACGGTA 50 

AAGAAAAACT GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTGTTGTC 100 

35 CGTTGTATCG CAATGGGTAC AT CAGATGGC CTAAGCCGTG GTTTAGAAGT 150 

TAAAAATTTA GAACACCCAA TTGAAGTACC AGTAGGTAAA GCAACACTCG 2 00 

GACGTATCAT GAACGTTCTG GGTGACCCTA TTGATATGAA AGGTGATATC 2 50 

GGCGAAGAAG AGCGTTGGTC TATTCAC CGC GCTGCACCAA GCTACGAAGA 3 00 

GCTATCGAGC TCAACTGAAC TGCTAGAGAC AGGTATCAAA GTCATGGACT 3 50 

40 TGATCTGTCC ATTCGCGAAA GGTGGTAAAG TTGGTCTGTT CGGTGGTGCG 4 00 

GGTGTTGGTA AAACGGTAAA CATGATGGAA CTTATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TATTCGCAGG TGTTGGTGAG CGTACCCGTG 5 00 

AAGGTAACGA CTTCTATCAT GAAATGACAG ATTCAAACGT TCTTGACAAA 550 

GTATCACTGG TTTATGGTCA GATGAATGAG CCACCAGGAA ACCGTCTACG 600 

4 5 CGTAGCATTG ACTGGTTTGA CTATGGCTGA GAAATTCCGT GACGAAGGCC 650 

GTGATGTTCT GTTGTTCGTG GATAACATCT ATCGTTATAC ACTGGCAGGT 700 

ACAGAAGTAT CGGCTCTGTT AGGTCGTATG CCATCAGCAG TAGGTTATCA 750 

GCCAACATTG GCAGAAGAGA TGGGTGTTCT TCAAGAACGT ATCACTTCTA 8 00 

CTAAGACAGG TTCTATCAC 819 

5 0 



2) INFORMATION FOR SEQ ID NO: 340 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Psychrobacter phenylpyruvicus 

5 (B) STRAIN: ATCC 23333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 

AACCGCACTG ACGTGCC T CA AATTTATGAC GCGTTAGTTG TAGATGGTAC 50 

10 CGAAACCACC CTAGAAGTTC AGCAG CAGCT GGGCGATGGT GTGGTACGTA 100 

CTATTGCAAT GGGATCTACT GAAGGTCTTA AGCGTGGTTT ACCAGTAACA 150 

AACACTGGTG CCCCAATTAC AGTTCCAGTA GGTGATGCGA CTTTAGGTCG 200 

TATTATGGAC GTTTTAGGTC GTCCAATTGA CGAACAAGGT CCAGTTAATT 250 

C TGAAGAC CA TTGGTCAATC CACCGTCAAG CGCCATCATA TGATGAGCAA 3 00 

15 GCTAACAGTA CTGACCTATT AGAGACAGGT ATTAAAGTAA TTGACTTACT 350 

TTGTCCGTTC GCTAAAGGGG GTAAAGTTGG TCTGTTCGGT GGTGCCGGTG 4 00 

TTGGTAAAAC CGTAAACATG ATGGAATTGA TTAATAACAT CGCTCTTAAG 4 50 

CACTCAGGTT TATCAGTATT CGCTGGTGTG GGTGAGCGTA CTCGTGAAGG 50 0 

TAACGACTTC TACCACGAGA TGCAAGAAGC GGGTGTTGTT GACGTTGAAA 550 

2 0 ACTTCACCAA CTCAAAAGTT GCGATGGTTT ATGGTCAGAT GAATGAGCCA 600 

CCAGGTAACC GTTTACGTGT TGCGTTAACC GGTCTGACTA TGGCTGAGTA 650 

CTTCCGTGAT CAAAAAGATG AAAACGGTAA AGGTAAAGAC GTTCTATTAT 700 

TCGTTGATAA CATCTACCGC TACACGCTAG CCGGTACTGA AGTATCAGCA 75 0 

CTTCTAGGTC GTATGCCATC AGCAGTAGGT TATCAGCCAA CACTAGCGGA 800 

25 AGAGATGGGT GTACTACAAG AGCGTATTAC TTCAACTCAG ACTGGTTCTA 850 

TTACTTC 90 7 



30 2) INFORMATION FOR SEQ ID NO: 341 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatills 

(B) STRAIN: DSM 45 94 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 



GTTCCCTCAG GATGCAGTAC CGAACGTGTA CAATGCTCTT GAGGTAGAAA 50 

ACGGTACCTC CAAACTGGTG CTGGAAGTTC AGCAACAGTT AGGCGGCGGC 10 0 

GTTGTTCGTT GTATCGCAAT GGGTACCTCA GACGGCCTGC GTCGCGGTCT 15 0 

GAAAGTGAAC AACCTGGAAC ACCCAATTGA AGTACCGGTA GGTAAAGCGA 2 00 

50 CTCTGGGTCG TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAAGGT 2 50 

GAAATCGGCG AAGAAGAACG TCGTGCTATT CACCGTGCTG CGCCTTCTTA 3 00 

TGAAGAGCTG GCAAACTCCC AGGAATTGCT GGAAACCGGT ATCAAAGTTA 3 50 

TGGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT 40 0 

GGTGCGGGTG TAGGTAAAAC TGTGAACATG ATGGAGCTGA TCCGTAACAT 45 0 

55 TGCGATCGAG CACTCCGGTT ATTCTGTGTT TGCAGGCGTG GGTGAACGTA 50 0 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACTGATTC CAACGTTATC 55 0 

GACAAAGTTT CCCTGGTGTA TGGCCAGATG AATGAGCCAC CAGGTAACCG 60 0 

TCTGCGCGTT GCACTGACCG GC CTGAC CAT GGCGGAAAAA TTCCGTGATG 650 

AAGGTCGTGA CGTACTGCTG TTCGTTGACA AC ATTT AC CG TTACACCCTG 700 

60 GCCGGTACCG AAGTGTCCGC ACTTCTGGGC CGTATG CCAT CGGCGGTAGG 75 0 
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TTATCAGCCA ACGCTGGCGG AAGAGATGGG 
CCTCGACCAA AAGTGGTTCT ATCACCTCCG 



CGCTCTGCAA GAACGTATCA 
TA 



800 
832 



2) INFORMATION FOR SEQ ID NO: 342 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. ari zonae 

(B) STRAIN: ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 10 0 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCTATTC ACCGCGCGGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TAAAAGTTAT 350 

CGACCTGATG TGTC CGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 45 0 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 50 0 

TCGTGAGGGT AACGACTTCT AC CACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 60 0 

CTGCGCGTTG CATTGAC CGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATAC CCTCG 70 0 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TAYCASCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCAC 824 



2) INFORMATION FOR SEQ ID NO: 343 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
s erotype chol eraesui s 

(B) STRAIN: ATCC 7001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343 

GATGCCGTAC CACGCGTGTA CGATGCCCTT GAGGTGCAGA ATGGTAATGA 50 
GAAGCTGGTG CTGGAAGTTC AGGAGCAGCT TGGCGGCGGT ATCGTGCGTA 100 
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CCATCGCGAT GGGGTCTTCT GACGGTCTGC GTCGCGGTCT GGATGTAAAA 150 

GATCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCCA CGCTGGGTCG 2 00 

TATCATGAAC GTCCTGGGCG AACCGGTCGA CATGAAAGGC GAGATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA CGAAGAGTTG 30 0 

5 TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTTA TTCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCAGTGTT TGCGGGCGTA GGGGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GATAAAGTAT 550 

10 CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 65 0 

TGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 7 00 

AAGTATC CGC ACTGCTGGGC CGTATGCCTT CCGCAGTAGG TTACCAGCCG 750 

ACTCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA CCTCCACCAA 80 0 

15 AACCGGTTCT ATCACCTCCG 82 0 



2) INFORMATION FOR SEQ ID NO: 344 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM : Salmonella choleraesuie subsp. dlarizonae 

(B) STRAIN: ATCC 4 3 973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 

35 TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 10 0 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

GCTGGGT CGT ATCATGAACG TCCTGGGTGA ACCGGTCGAT ATGAAAGGCG 25 0 

4 0 AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

45 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGC CAGATGA ACGAGCCACC GGGAAACCGT 6 00 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACC CTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

50 TATCAGCCGA CTCTGGCTGA AGAAATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



55 2) INFORMATION FOR SEQ ID NO: 345 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43 974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTACGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCTGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGT CGGT CTGTTCGGTG 4 00 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAAC AT C 4 50 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGTCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 7 00 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAG ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 346 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTCGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 10 0 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGC C AC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAT 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 
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CCGGTACGGA AGTTTCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 75 0 

TATCAGCCAA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGT AT C AC 8 00 

CTCCACCAAA ACCGGTTCTA TCACCTCCG 829 



2 ) INFORMATION FOR SEQ ID NO: 347 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype paratyphi A 
20 (B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

25 GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 15 0 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCTACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 3 00 

3 0 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 450 
TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA * 500 

CGACTTCTAC CATGAAATGA CCGACTCCAA CGTTAT CGAT AAAGTATCCC 550 

3 5 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 70 0 

TATCCGCACT GCTGGGTCGT ATGCCTTCCG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

40 CGGTTCTATC ACCTCCG 817 



45 



60 



2) INFORMATION FOR SEQ ID NO: 348 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype paratyphi B 
(B) STRAIN: ATCC 8759 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 
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GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 5 0 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATT GTGCGTACCA 10 0 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 15 0 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 2 00 

5 CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 2 50 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTGG 40 0 

GTAAAACCGT AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 4 50 

10 TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTAT CGAT AAAGT ATCC C 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 60 0 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 70 0 

15 TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 8 00 

CGGTTC 806 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 34 9 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 831 bases 
25 (B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuis subsp. salamae 

(B) STRAIN: ATCC 4 3 972 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGKGGTCTG 150 

4 0 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

GCTGGGCCGT ATCATGAACG TACTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATTGGTGA AGAAGAGCGT TGGGCTATTC ACCGTGCTGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGAC CTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

45 GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 55 0 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAAC CGT 60 0 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 65 0 

5 0 AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 70 0 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 75 0 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 80 0 

CTCCACTAAA AC CGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 350 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 823 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

s e r o t yp e typhi 

(B) STRAIN: ATCC 1074 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTC CCGGTAG GTAAAGCTAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 25 0 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 3 00 

GAAGAGTTAT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCATGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGC CAGATGA ACGAGCCGCC GGGAAAC CGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAAT TCCGTGACGA 650 

AGGT CGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGC GGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAG ACCGGTTCTA TCA 823 



2) INFORMATION FOR SEQ ID NO : 3 51 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype typhimuri urn 

(B) STRAIN: ATCC 14 02 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TTGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 2 50 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 40 0 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCAGTGTTT GCGGGCGTAG GGGAACGTAC 500 
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TCGTGAGGGT AACGACTTCT 
ATAAAGTATC CCTGGTGTAT 
CTGCGCGTTG CATTGACCGG 
AGGT CGTGAC GTACTGCTGT 
5 CCGGTACGGA AGTATCCGCA 
TACCAGCCGA CTCTGGCGGA 
CTCCACCAAA ACCGGTTCTA 



ACCACGAAAT GACCGACTCC 
GGCCAGATGA ACGAGCCGCC 
TCTGACCATG GCGGAGAAAT 
TCGTCGATAA CATCTATCGT 
CTGCTGGGCC GTATGCCTTC 
AGAGATGGGC GTTCTGCAGG 
TCACCTCCGT A 



AACGTTATCG 550 

GGGAAACCGT 600 

TCCGTGACGA 650 

TACACCCTGG 700 

CGCAGTAGGT 750 

AACGTATCAC 8 00 
831 



10 



20 



25 



2) INFORMATION FOR SEQ ID NO: 352 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 810 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 



GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTT CAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

3 0 TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

3 5 TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTCTCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

4 0 TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTG CTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 
CGGTTCTATC 



45 



810 



2) INFORMATION FOR SEQ ID NO: 353 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Serratia ficaria 
60 (B) STRAIN: ATCC 33105 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 



GATGCCGTAC 
5 GAAACTGGTG 
GTATCGCAAT 
AACCTGGAAC 
TATCATGAAC 
AAGAAGAGCG 

10 TCCAACTCCC 
TTGTCCGTTC 
TGGGCAAAAC 
CACTCCGGTT 
TAACGACTTC 

15 CCCTGGTTTA 
GCATTGACCG 
CGTTCTGCTG 
AAGTGTCCGC 
ACGCTGGCGG 

2 0 GACCGGTTCC 



CAAAAGTGTA 
CTGGAAGTTC 
GGGGACCTCT 
AC CCGATTGA 
GTATTGGGCG 
TTGGGCGATT 
AGGACCTGCT 
GCCAAGGGCG 
CGTAAACATG 
ATTCCGTGTT 
TACCACGAAA 
CGGCCAGATG 
GCCTGACCAT 
TTCGTTGACA 
ACTTCTGGGC 
AAGAGATGGG 
ATCACCTCCG 



CGATGCCCTT 
AGCAGCAGCT 
GACGGTCTGC 
AGTGCCGGTG 
AACCAATCGA 
CACCGTCCTG 
GGAAACCGGT 
GTAAAGTCGG 
ATGGAGCTGA 
TGCGGGCGTG 
TGAACGACTC 
AACGAGCCGC 
GGCGGAGAAA 
ACATTTACCG 
CGTATGCCAT 
CGTTCTGCAA 



GAGGTAGAAA 
GGGCGGTGGC 
GTCGCGGTCT 
GGTAAAGCTA 
CATGAAAGGC 
CGCCAAGCTA 
ATCAAGGTAA 
TCTGTTCGGT 
TCCGTAACAT 
GGCGAGCGTA 
CAACGTTCTG 
CGGGTAACCG 
TTCCGTGACG 
TTACAC C CTG 
CCGCGGTAGG 
GAACGTATCA 



ACGGTACCGA 50 

GTCGTTCGCT 100 

GAAAGTGAAC 150 

CCCTGGGCCG 2 00 

GAGATCGGCG 250 

CGAAGAGCTG 300 

TGGACCTGAT 350 

GGTGCGGGCG 400 

CGCGATCGAG 4 50 

CTCGTGAGGG 500 

GACAAAGTAT 55 0 

TCTGCGCGTT 600 

AAGGCCGCGA 650 

GCCGGTACCG 700 

TTATCAGCCA 75 0 

CCTCGACCAA 80 0 
820 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 354 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia. fonticola 

(B) STRAIN: ATCC 29844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 



4 0 GATGCCGTAC 
GAAATTGGTG 
GTATCGCAAT 
GACCTGCAGC 
TATCATGAAC 

4 5 AAGAAGAACG 
TCCAGCTCCC 
CTGCCCGTTC 
TAGGTAAAAC 
CACTCCGGTT 

50 TAACGACTTC 
CCCTGGTTTA 
GCGCTGACCG 
CGTACTGCTG 
AAGTGTCCGC 

55 ACGCTGGCGG 
GACTGGTTCA 



CGAAAGTGTA 
CTGGAAGTTC 
GGGGACCTCA 
ACCCAATTGA 
GTATTGGGTG 
TTGGGCTATT 
AGGATC TGCT 
GCCAAGGGTG 
CGTAAACATG 
ATTCTGTGTT 
TACCACGAAA 
CGGCCAGATG 
GCCTGACCAT 
TTCGTCGATA 
ACTTCTGGGC 
AAGAGATGGG 
ATCACC 



CAACGCTCTT 
AGCAACAGCT 
GACGGTCTGC 
AGTAC CGGTA 
AACCAATCGA 
CACCGCCCTG 
GGAAACCGGT 
GTAAAGTTGG 
ATGGAGCTGA 
TGCGGGCGTG 
TGACCGATTC 
AACGAGCCAC 
GGCTGAGAAG 
ACATCTACCG 
CGTATGCCAT 
TGTTCTGCAA 



GAGGTTGCAA 
GGGTGGCGGC 
GTCGTGGTCT 
GGTAAAGCTA 
CATGAAGGGC 
CGCCAAGCTA 
ATCAAGGTAA 
TCTGTTCGGT 
TCCGTAACAT 
GGTGAACGTA 
CAACGTACTG 
CAGGTAACCG 
TTCCGTGACG 
TTATACCCTG 
CGGCGGTAGG 
GAACGTATCA 



ACGGCACCGA 50 

GTGGTT CGCT 100 

GGCCGTAACC 150 

CCCTGGGCCG 2 00 

GACATCGGCG 250 

CGAAGAGCTG 3 00 

TGGACCTGAT 350 

GGTGCTGGTG 4 00 

CGCGATCGAG 4 50 

CTCGTGAGGG 500 

GACAAAGTTT 550 

TCTGCGCGTT 600 

AAGGTCGTGA 650 

GCCGGTACCG 700 

TTATCAGCCA 750 

CCTCTACCAA 800 
816 



60 2) INFORMATION FOR SEQ ID NO: 355 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimesii 

(B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 

GATGCCGTAC CAAAAGTGTA CAACGCTCTT GAGGTAGAAA ACGGTGCCAA 
TAAGCTGGTG CTGGAAGTTC AGCAACAGTT GGGCGGTGGC GTGGTTCGCT 
GTATCGCAAT GGGGACCTCT GATGGTCTGC GTCGCGGTCT GAAAGT CACA 
GACCTGGACC AC CCAATTGA AGTAC CGGTA GGTAAAGCTA CTCTGGGCCG 
TATCATGAAC GTATTGGGTG AAC CAATCGA CATGAAGGGC GATATCGGCG 
AAGAAGAACG TTGGGCGATT CACCGTCCGG CGCCAAGCTA CGAAGATTTG 
GCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 
CTGCCCGTTC GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 
TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 
CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 
TAACGACTTC TACCACGAAA TGAACGACTC CAACGTACTG GACAAAGTAT 
CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 
GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGCCGTGA 
CGTTCTGCTG TTCGTTGATA ACATCTACCG TTATACCCTG GCCGGTACCG 
AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 
ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 
GACTGGTTCA ATCACCTCCG TA 



2) INFORMATION FOR SEQ ID NO: 3 56 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 2 75 92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 

ATGCCGTACC AAAAGTGTAC AATGCTCTTG AGGTAGAAAA CGGTACCGAG 
AAGCTGGTGC TGGAAGTT CA GCAGCAGTTG GGCGGTGGCG TAGTTCGCTG 
TATCGCGATG GGGAC C TCAG ATGGTCTGCG CCGCGGTCTG AAAGTGAACG 
ATCTGGAACA CCCAATTGAA GTACCGGTAG GTAAAGC TAC CCTGGGCCGT 
ATCATGAACG TATTGGGTGA ACCAATCGAC ATGAAAGGCG ACAT CGGCGA 
AGAAGAACGT TGGGCGATTC ACCGTCCAGC GCCAAGCTAC GAAGATTTGT 
CAAACTCCCA GGATCTGCTG GAAACCGGTA TCAAGGTTAT GGAC CTGATT 
TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCTGGTGT 
TGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATC GAGC 



344 



CA 02283458 1999-09-28 



10 
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ACTCCGGTTA TTCCGTGTTT GCAGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GAACGACTCC AACGTACTGG ACAAAGTATC 550 

CCTGGTTTAC GGCCAGATGA ACGAGCCACC GGGTAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCGGAGAAAT TCCGTGACGA AGGCCGCGAC 650 

GTTCTGCTGT TCGTTGATAA CATTTACCGT TATACCCTGG CCGGTACCGA 700 

AGTGTCCGCA CTTCTGGGCC GTATGCCATC TGCGGTAGGT TATCAGCCAA 750. 

CGCTGGCGGA AGAGATGGGC GTTCTGCAAG AACGTATCAC CTCTACCAAG 800 

ACCGGTTCTA TCACTTCCG 819 



2) INFORMATION FOR SEQ ID NO: 357 



(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 



TCAGGATGCC GTACCGAAAG TGTACGACGC CCTTGAGGTA GAAAACGGCA 50 

CCGAAAAACT GGTGTTGGAA GTTCAGCAAC AGCTGGGCGG TGGCGTGGTT 100 

3 0 CGCTGTATCG CAATGGGGAC CTCCGACGGT CTGCGTCGCG GTCTGAAAGT 150 

GAACAAC CTG GACCACCCGA TTGAAGTGCC GGTGGGTAAA GCTACCCTGG 2 00 

GTCGTATCAT GAACGTATTG GGTCAACCGA TCGACATGAA AGGCGACATC 2 50 

GGCGAAGAAG AGCGTTGGGC GATTCACCGC GCGGCGCCAA GCTACGAAGA 3 00 

GCTGTCAAGC TCTCAGGAAC TGCTGGAAAC CGGTATCAAG GTAATGGACC 3 50 

3 5 TGATTTGTCC GTTCGCCAAG GGCGGTAAAG TCGGTCTGTT CGGCGGTGCG 4 00 

GGCGTAGGTA AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT 4 50 

CGAGCACTCC GGTTATTCCG TGTTTGCGGG CGTGGGCGAG CGTACTCGTG 5 00 

AGGGTAACGA CTTCTACCAC GAAATGACCG ACTCCAACGT TCTGGACAAA 550 

GTATCCCTGG TTTACGGCCA GATGAACGAG CCACCAGGTA ACCGTCTGCG 600 

4 0 CGTTGCGCTG ACCGGTCTGA CCATGGCGGA GAAATTCCGT GACGAAGGCC 650 

GTGACGTTCT GCTGTTCGTT GACAACATCT ACCGTTACAC CCTGGCCGGT 700 

ACCGAAGTGT CCGCACTTCT GGGCCGTATG CCATCCGCGG TAGGTTATCA 75 0 

GCCAACGCTG GCGGAAGAGA TGGGCGTTCT GCAAGAACGT ATCACCTCGA 800 

CCAAG 805 

45 

2 ) INFORMATION FOR SEQ ID NO : 3 58 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odorifera 
60 (B) STRAIN: ATCC 33077 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 

GATGCCGTAC CAAAAGTGTA CGATGCCCTT GAGGTAGAAA ACGGTACCGA 5 0 

5 AACTCTGGTG CTGGAAGTTC AGCAGCAGCT GGGCGGTGGC GTGGTT CGTT 100 

GTATCGCTAT GGGCACCTCC GACGGTTTGC GTCGCGGCCT GAAAGTGAAC 150 

GATCTGCAAC ACCCAATCGA AGTCCCGGTT GGCAAGGCAA CGCTGGGTCG 200 

TATCATGAAC GTATTGGGTC AACCAATCGA CATGAAAGGC GACATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCAAGCTA CGAAGAACTG 3 00 

10 TCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTTA TGGAC CTGAT 350 

CTGCCCGTTT GCCAAGGGTG GTAAAGT CGG TCTGTTCGGT GGTGCGGGTG 4 00 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGAT CGAG 450 

CACTCCGGTT ATTCAGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTACTG GACAAGGTTT 550 

15 CCCTGGTTTA CGGCCAGATG AACGAGC CAC CGGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGC CGAGAAA TTCCGTGACG AAGGTCGTGA 6 50 

CGTTCTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG GCCGGTACCG 700 

AAGTGTCTGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATC AG C CA 75 0 

ACGCTGGCGG AAGAGATGGG CGTTCTGCAA GAACGTATCA CCTCGACCAA 8 00 

2 0 GAC CGGTTCT ATCACCTCCG TA 82 2 



2) INFORMATION FOR SEQ ID NO: 359 



25 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 05 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratia plymuthlca 

(B) STRAIN: ATCC 183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 

4 0 GTGTACAACG CTCTTGAGGT AGAAAACGGT GCCAATAAGC TGGTGCTGGA 50 

AGTTCAGCAA CAGCTGGGCG GTGGCGTGGT TCGCTGTATC GCGATGGGGA 100 

CCTCTGATGG TCTGCGTCGC GGTCTGAAAG TGATCGACCT GGATCACCCG 150 

ATTGAAGTAC CGGTAGGTAA AGCTACCCTG GGC CGTATCA TGAACGTATT 2 00 

GGGTGAACCA AT CGAC ATGA AAGGCGACAT CGGCGAAGAA GAACGTTGGG 250 

45 CAATTCACCG TCCAGCGCCA AGCTACGAAG ATTTGGCCAA CTCCCAGGAT 300 

CTGCTGGAAA CCGGTATCAA GGTTATGGAC CTGATCTGTC CGTTCGCTAA 350 

GGGCGGTAAA GTGGGTCTGT TCGGCGGTGC GGGCGTGGGT AAAACCGTAA 40 0 

ACATGATGGA GCTGATCCGT AACATCGCGA TCGAACACTC CGGTTATTCC 450 

GTGTTTGCGG GCGTGGGTGA GCGTACTCGT GAGGGTAACG ACTTCTACCA 50 0 

50 CGAAATGAAC GACTC CAACG TACTGGACAA AGTATCCCTG GTTTACGGCC 550 

AGATGAACGA GCCACCGGGT AACCGTCTGC GCGTTGCTCT GACCGGTCTG 600 

AC CATGGCGG AGAAATTCCG TGACGAAGGC CGCGACGTTC TGCTGTTCGT 65 0 

TGATAACATC TACCGTTATA CCCTGGCCGG TACCGAAGTG TCCGCACTTC 70 0 

TGGGCCGTAT GCCATCTGCG GTAGGTTATC AGCCAACGCT GGCGGAAGAG 75 0 

55 ATGGGCGTTC TGCAAGAACG TATCAC CTCT ACCAAGACCG GTTCTATCAC 8 00 

CTCCG 80 5 



60 2) INFORMATION FOR SEQ ID NO: 360 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 



TTCCCTCAGG ATGCCGTACC AAAAGTGTAC GATGCCCTTG AGGTAGAGAA 50 

CGGTAACGAA AAACTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGCG 100 

TTGTACGCTG TATCGCCATG GGTACTTCCG ACGGTCTGCG TCGCGGTCTG 150 

AAAGTTAACG ACCTCGAGCA CCCAATCGAA GTGCCAGTTG GTAAAGCAAC 200 

2 0 GCTGGGTCGT ATCATGAACG TATTGGGTCA GCCAATCGAC ATGAAAGGCG 250 

ACATCGGCGA AGAAGAGCGT TGGGC GATTC ACCGCGCGGC GC CAAGCTAC 3 00 

GAAGAGC TGT CCAGCTCCCA AGAGCTGCTG GAAACCGGTA TCAAGGTAAT 3 50 

GGACCTGATC TGCCCGTTCG CCAAGGGTGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

2 5 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAGCGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTACTGG 550 

ACAAAGTATC CCTGGTTTAC GGCCAGATGA ACGAGCCGCC GGGTAACCGT 60 0 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGCCGCGAC GTTCTGCTGT TCGTGGATAA CATCTACCGT TACACCCTGG 700 

3 0 C CGGTAC CGA AGTGTCCGCA CTGCTCGGCC GTATGCCATC TGCGGTAGGT 750 

TATCAGC CAA CGCTGGCGGA AGAGATGGGC GTTCTGCAAG AACGTATCAC 8 00 

CTCGACCAAG ACCGGTTCAA TCACCTCCGT A 831 



35 



45 



2 ) INFORMATION FOR SEQ ID NO : 361 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 06 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putida 

(B) STRAIN: LCDC D717 9 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361 

CCGTAAACAT GATGGAACTG ATC CGTAACA TCGCCATCGA GCACAGCGGT 5 0 

TATTCCGTGT TCGCCGGTGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 100 

CTACCACGAG ATGAAGGACT CCAACGTTCT CGACAAGGTA GCGCTGGTCT 15 0 

55 ACGGTCAGAT GAACGAGC CA CCAGGAAACC GTCTGCGCGT AGCGCTGACC 2 00 

GGCCTGACCA TGGCCGAGAA GTTCCGTGAC GAAGGTAACG ACGTTCTGCT 25 0 

GTTCGTCGAC AACATCTATC GTTACACCCT GGC CGGTAC C GAAGTATCCG 3 00 

CACTGCTGGG CCGTATGCCT TCGGCGGTAG GTTACCAGCC GACCCTGGCT 3 50 

GAAGAGATGG GCGTTCTGCA AGAACGTATC ACTTCGACCA AGGAAGGTTC 4 00 

60 GATCAC 4 06 
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2) INFORMATION FOR SEQ ID NO: 362 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Shigella boydii 

(B) STRAIN: ATCC 92 07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 

2 0 TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA 50 

TGGTAATGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTGCGTAC CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 200 

TCTGGGCCGT AT C ATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG 250 

2 5 AGATCGGTGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC 3 00 

GAAGAGCTGT CAAACTCTCA GGAAC TGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGAC CTGATG TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

3 0 TCGTGAGGGT AACGACTTCT AC C ACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGC CAGATGA ACGAGCCGCC GGGAAACCGT 60 0 

CTGCGCGTTG CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG 70 0 

CCGGTACGGA AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

3 5 TATCAGCCGA CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAA ACTGGTTCTA TCACCTCCGT A 831 



40 2) INFORMATION FOR SEQ ID NO: 363 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 02 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363 



GCCGTACCGC GCGTGTACGA TGCTCTTGCG GTGCAAAATG GTAATGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCAATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 2 00 

60 CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 
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i 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 29930 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

GTACCGCGCG TGTACGATGC TCTTGAGGTG CAAAATGGTA ATGAGCGTCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATCGTG CGTACCATCG 10 0 

10 CAATGGGTTC CTCCGACGGT CTGCGTCGCG GTCTGGATGT AAAAGACCTC 150 

GAACACCCGA TCGAAGTCCC GGTAGGTAAA GCGACTCTGG GCCGTATCAT 200 

GAACGTACTG GGTGAACCGG TCGACATGAA AGGCGAGATC GGTGAAGAAG 2 50 

AGCGTTGGGC GATTCACCGC GCAGCACCTT CCTACGAAGA GCTGTCAAAC 3 00 

TCTCAGGAAC TGCTGGAAAC CGGTATCAAA GTTATCGACC TGATGTGTCC 35 0 

15 GTTCGCTAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 400 

AAACCGTAAA CATGATGGAG CTCATTCGTA ACATCGCGAT CGAGCACTCC 4 50 

GGTTACTCTG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGACAAA GTATCCCTGG 55 0 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCTCTG 60 0 

2 0 ACCGGTCTGA CCATGGCTGA GAAATTCCGT GACGAAGGTC GTGACGTTCT 650 

GCTGTTCGTT GACAACATCT ATCGTTACAC CCTGGCCGGT ACGGAAGTAT 700 

CCGCACTGCT GGGCCGTATG CCTTCAGCGG TAGGTTATCA GCCGACCCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACTGG 800 

TT 802 

25 



2) INFORMATION FOR SEQ ID NO: 366 

3 0 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366 

TAACGCCTTG GTTATTGATG TGCCTAAAGA AGAAGGTACA ATACAACTAA 50 

CATTAGAAGT TGCGCTGCAA TTAGGTGACG ACGTTGTTCG TACAATTGCG 100 

45 ATGGATTCAA CTGATGGTGT CCAAAGAGGC ATGGATGTAA AAGATACAGG 150 

CAAAGAAATT AGTGTACCTG TTGGTGATGA AACATTAGGT CGTGTATTTA 2 00 

ATGTACTAGG TGAAACAATT GAC CTTAAAG AAGAAATTAG TGATTCTGTT 2 50 

CGCCGCGATC CTATCCATCG T CAAGCAC C A GCATTCGATG AACTTTCAAC 3 00 

AGAAGTTCAA ATTTTAGAAA CAGGTATTAA AGTAGTAGAT TTACTAGCAC 350 

50 CTTATAT TAA AGGTGGTAAA ATCGGATTGT TCGGTGGTGC CGGTGTAGGT . 4 00 

AAAACAGTAT TAATCCAAGA ATTAATTAAC AACATCGCTC AAGAGCACGG 450 

TGGTATTTCT GTATTCGCCG GTGTAGGTGA ACGTACTCGT GAAGGTAACG 500 

ATTTATACTT CGAAATGAGT GATAGTGGTG TAATTAAGAA AACAGC CATG 550 

GTATTCGGGC AAATGAATGA GCCACCTGGT GCACGTATGC GTGTTGCATT 600 

55 ATCTGGTTTA ACAATGGCTG AATATTTCCG TGACGAACAA GGTCAAGACG 650 

TATTATTATT CATCGATAAC ATTTTCAGAT TTACACAAGC TGGTTCTGAG 7 00 

GTATCTGCAT TATTAGGTCG TATGCCTTCT GCAGTAGGTT ACCAACCAAC 750 

ACTTGCTACT GAAATGGGAC AATTACAAGA ACGTA 785 

60 
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2) INFORMATION FOR SEQ ID NO: 3 67 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 84 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN: ATCC 33753 



15 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 



GAACATAACG AAGTGCCTAA TATTAATAAC GC CTTAGTAC TCGATGTTGA 50 

AAGAGAAGAC GGAACAGTGT CTTTAACTTT AGAAGTAGCT TTACAATTAG 100 

2 0 GCGATGACGT TGTTCGTACC ATTGCAATGG ATTCAACTGA TGGTGTTAAA 15 0 

CGTGGTAACG AAGTCAAAGA TACTGGTAAT AGCATTAGCG TACCAGTCGG 2 00 

AGACGAAACT TTAGGACGTG TCTTCAACGT T C T AGGTGAA ACAATTGATT 25 0 

TAGAAGATAA ACTTGATGAT TCTGCGCGAC GTGACCCTAT ACATAGAGAA 3 00 

GCGCCAGCGT TTGATCAATT ATCAACTCAA GTTGAAATTT TAGAAACAGG .35 0 

25 AATTAAAGTT GTTGACTTAT TAGCACCTTA TATTAAAGGT GGTAAAGTTG 400 

GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTTTAAT CCAAGAATTA 450 

ATCAACAACA TCGCTCAAGA ACACGGTGGT ATTTCAGTCT TTGCCGGTGT 500 

AGGTGAACGT ACACGTGAAG GTAACGACTT GTACTATGAA ATGAGCGACA 550 

GTGGTGTAAT CAAGAAAACA GCCATGGTCT TCGGACAAAT GAACGAAC CA 60 0 

3 0 CCTGGCGCAC GTATGCGTGT TGCTTTATCT GGTTTAACAA TGGCTGAATA 650 

TTTCCGTGAT GAACAAGGAC AAGACGTATT GTTATTCATC GACAATATTT 700 

TCCGTTTCAC ACAAGCCGGT TCAGAAGTTT CTGCCTTACT AGGTCGTTTA 750 

CCATCAGCCG TTGGTTATCA ACCTACATTA GCAACAGAAA TGGGACAATT 8 00 

ACAAGAACGT ATTACTTCAA CAACAAAAGG ATCAGTTACT TCA 84 3 

35 

2) INFORMATION FOR SEQ ID NO : 368 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 
50 (B) STRAIN: ATCC 27840 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 

GCTTTGAACA TAATGAAGTT CCTGATATTA ACAATGCCTT ACACATCGAA 50 

55 GTTCCTAAAG AAGATAGCAC ACTTCATTTA ACTTTAGAAG TTGCACTTCA 100 

ATTAGGTGAC GATGTAGTAC GTACAATCGC AATGGACTCA ACTGACGGCG 15 0 

TTCAAAGAGG TATGGAAGTT AAAGATACAG GTAAAGATAT TAGCGTACCT 2 00 

GTTGGTGATG CAACTTTAGG AAGAGTATTT AACGTATTAG GAGAAACAAT 250 

CGATTTAGAT GAAAAGATTG ATGATTCAGT ACGTCGTGAT CCTATTCATA 3 00 

60 GACAGGC AC C TGGCTTCGAT GAATTATCTA CTAAAGTAGA AAT CTTAGAA 3 50 
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ACAGGTATCA AAGTAGTAGA CTTATTAGCA 
AATTGGATTA TTCGGTGGTG CCGGTGTTGG 
AACTTATCAA TAATATCGCT CAAGAGCATG 
GGTGTTGGTG AACGTACACG TGAAGGTAAC 
5 CGATAGTGGT GTAATTAAGA AAAC AG CGAT 
AGCCACCTGG TGCTCGTATG CGTGTTGCAT 
GAATATTTCC GTGATGAAGA AGGC CAAGAC 
TATCTTCAGA TTCACACAAG CTGGTTCTGA 
GTATGC CTTC AGCCGTTGGT TATCAACCAA 
10 CAATTACAAG AACGTATTAG TTCAACTAAT 



CCTTACATTA AAGGTGGTAA 400 

TAAGACAGTT TTAATCCAAG 450 

GTGGTATTTC AGTATTCGCC 5 00 

GACCTTTACT ATGAAATGAG 5 50 

GGTATTCGGT CAGATGAACG 6 00 

TATCAGGTTT AACAATGGCA 650 

GTATTATTAT TCATTGATAA 700 

AGTTTCAGCA TTACTTGGAC 750 

CACTTGCTAC TGAAATGGGT 800 

AAAGGTTCTG TTACTTCAA 84 9 



2) INFORMATION FOR SEQ ID NO: 369 



15 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
2 0 <D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

2 5 (A) ORGANISM: Staphylococcus cohnii 

(B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 

3 0 GAAGTTCCAG AAATTAATAA TGCCTTAGTT CTCGATATAG AAAGAGAAGA 5 0 

AGGTACTGTT GAATTAACGT TAGAAGTTGC ATTACAACTT GGTGATGACG 10 0 

TAGTACGAAC AATCGCAATG GATTCAACTG ATGGTGTTAA ACGTGGTACA 150 

GAAGTTAGAG ATAGCGGAAA TAGTATCAGC GTACCAGTTG GTAATGAAAC 2 00 

ATTAGGTAGA GTATTTAATG TATTAGGTGA TACGATAGAT TTAGATGAAG 25 0 

3 5 ACATAGATGA CTCAGTGCGT CGTGACCCAA TTCATAGAGA AGCACCTGCA 300 

TTTGATCAGT TATCTACTAA AGTTGAAATT TTAGAAACAG GTATCAAAGT 350 

CATTGATTTA TTAGCAC CAT ATATCAAAGG TGGTAAAGTT GGATTATTCG 40 0 

GTGGTGCCGG TGTTGGTAAA ACTGTATTAA TTCAAGAATT AATCAATAAT 450 

ATCGCTCAAG AGCATGGTGG TATATCCGTA TTTGC TGGTG TAGGTGAGCG 50 0 

4 0 TACGCGTGAA GGTAATGACC TATACTTTGA AATGAGTGAT AGTGGTGTTA 550 

TTAAAAAGAC AGCTATGGTA TTTGGACAAA TGAACGAACC ACCTGGTGCG 600 

CGTATGCGAG TAGCACTTTC TGGTTTAACA ATGGCTGAAT ATTTCCGGGA 650 

TGAACAAGGA CAAGATGTTC TATTATTCAT AGATAACATC TTTAGATTTA 700 

CTCAAGCTGG TTCAGAAGTT TCTGCGTTAT TAGGTCGTAT GCCTTCAGCT 750 

4 5 GTTGGTTACC AACCAACGTT AGCAACTGAA ATGGGACAAT TACAAGAACG 800 

TATTACTTCT ACAACTAAAG GTTCAGTAAC 83 0 



50 2) INFORMATION FOR SEQ ID NO: 370 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus epidermidis 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 



5 AATAATGCAT 
AACATTAGAA 
CAATGGACTC 
GGTAGAGACA 
TAACGTACTA 

10 TACGACGTGA 
ACAAAAGTAG 
ACCTTACATA 
GTAAAACCGT 
GGTGGTATCT 

15 TGATCTTTAC 
TGGTCTTTGG 
TTATCCGGAT 
TGTGTTATTA 
AAGTTTCTGC 

2 0 ACACTTGCTA 



TACACATCGA 
GTTGCACTTC 
AACTGACGGC 
TAAGTGTACC 
GGAGAAACTA 
CCCTATCCAT 
AAATCTTAGA 
AAAGGTGGTA 
ACTAATC CAA 
CAGTATT CGC 
TATGAAATGA 
TCAAATGAAT 
TAACAATGGC 
TTCATTGATA 
GTTATTAGGT 
CAGAAATGGG 



AGTTCCTAAA 
AACTAGGTGA 
GTTCAAAGAG 
TGTCGGTGAC 
TTGACTTAGA 
AGACAAGCTC 
AACTGGTATT 
AAATTGGATT 
GAACTTATTA 
TGGTGTTGGT 
GTGACAGTGG 
GAGCCACCTG 
CGAATATTTC 
ACATTTTCAG 
CGTATGCCAT 
TCAATTACAA 



GAAGATGGAG 
CGATGTAGTT 
GAATGGAAGT 
GTAACTC TAG 
TGAAAAAATT 
CAGGATT CGA 
AAAGTAGTAG 
ATTTGGTGGT 
ATAACATCGC 
GAACGTACAC 
TGTTATCAAG 
GTGCACGTAT 
CGAGATGAAG 
ATTCACTCAA 
CAGCTGTTGG 
GAACGTA 



CGCTTCAATT 50 

CGTACAATTG 100 

TAAAGATACA 150 

GAAGAGTGTT 2 00 

GATGATTCAG 250 

CGAATTATCA 3 00 

ACTTATTAGC 3 50 

GCCGGTGTAG 4 00 

TCAAGAACAC 4 50 

GTGAAGGTAA 500 

AAAACTGCAA 550 

GCGTGTAGCA 60 0 

AAGGCCAAGA 650 

GCTGGTTCAG 7 00 

TTATCAAC CT 750 
787 



2) INFORMATION FOR SEQ ID NO: 371 



25 



( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 

40 GAAGTAC CTG AAATTAATAA CGCCTTAATC ATCGAAGTTC CCAAAGAAGA 50 

TGGTACTTTT GAATTAACGC TTGAAGTTGC ATTACAACTA GGTGATGACG 10 0 

TTGTTCGTAC AATTGCTATG GATTCAACAG ATGGTGTTCA ACGTGGTATG 15 0 

GAAGTTCAGA ACACTGGAAA AGACATTTCA GTACCAGTTG GCGAAGTAAC 2 00 

TTTAGGACGT GTATTTAACG TATTAGGTGA CACAATTGAT TTAGAAGATA 250 

45 AATTAGATGG TTCAGTAAGA CGTGATCCAA TTCATAGACA ATCACCTAAC 3 00 

TTTGACGAAT TATCTACTGA AGTAGAAATT CTTGAAACTG GAATCAAAGT 3 50 

TGTAGACTTA TTAGCACCAT ACATCAAAGG TGGTAAAATC GGTCTATTTG 400 

GTGGTGCCGG TGTTGGTAAA ACCGTTTTAA TCCAAGAATT GATTAATAAT 45 0 

ATCGCACAAG AACATGGTGG TATCTCAGTA TTTGCTGGTG TAGGTGAACG 500 

50 TACACGTGAA GGTAACGACC TATATTATGA AATGAGAGAT AGTGGTGTTA 550 

TTAAGAAAAC AGCAATGGTA TTTGGTCAAA TGAACGAGCC ACCTGGTGCA 600 

CGTATGCGTG TGGCACTTTC TGCATTGACA ATGG CTGAGT ATTTCCGTGA 650 

TGAACAAGGA CAAGACGTTC TGTTATTCAT CGATAACATT TTCAGATTTA 70 0 

CTCAAGCAGG TTCAGAAGTA TCAGCATTAT TGGGACGTAT GCCTTCAGCT 750 

55 GTAGGTTATC AACCTACTTT AGCTACAGAA ATGGGTCAAT TACAAGAACG . 800 

TATTACATCA ACGAATAAAG GTTCAGTAAC 83 0 



60 2) INFORMATION FOR SEQ ID NO: 372 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis subsp. hominls 

(B) STRAIN: ATCC 27844 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 



TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 5 0 

CCCAAAAATG ATGG CACATT TAAATTAACA TTAGAAGTTG CATTGCAACT 10 0 

AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 15 0 

AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 2 00 

2 0 GGTGAAGAAA C AC TTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 2 50 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 3 00 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 350 

GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 4 00 

TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 4 50 

25 TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 5 00 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGC CATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 65 0 

TATTTCCGTG ATGAACAAGG TCAAGATGTA CTTTTATTCA TTGACAATAT 700 

3 0 TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 75 0 

TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAG CAACTGA AATGGGTCAA 80 0 

TTACAAGAAC GTATTACATC TACTAATAAA GGTTCAGTCA CTTCAA 84 6 



35 
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2) INFORMATION FOR SEQ ID NO: 373 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 84 6 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 175 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 

TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT CAAATTAACA TTAGAAGTTG CATTGCAACT 100 

AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

55 AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 2 00 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 2 50 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 3 00 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 350 

GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 4 00 

60 TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 450 
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TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

TATTTCCGTG ATGAACAAGG T C AAGATGT A CTTTTATTCA TTGACAATAT 70 0 

TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 

TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 800 

TTACAAGAAC GTATTACATC TAC TAATAAA GGTTCAGTCA CTTCAA 84 6 



2) INFORMATION FOR SEQ ID NO: 374 



( i ) SEQUENCE CHARACTERI STICS : 
15 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 

(B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374 



ATAATGAAGT GCCTGAAATA AATAATGCGC TCATTGTTGA AATTCCTAAA 5 0 

AGTGATACAA CAATCAGTTT AACACTTGAA GTTGCTTTGC AATTAGGTGA 100 

3 0 CGATGTTGTA CGTACTATTG CAATGGATTC AACTGATGGC GTTCAACGTG 150 

GTATGGAAGT TCAAAACACA GGTAAAGACA TCAGTGTACC TGTTGGAGAT 2 00 

GAAACATTAG GAAGAGTATT TAACGTTTTA GGAGAATCTA TTGATTTAGA 250 

AGAAAAGCTA GATGACTCTG TGCGTAGAGA TCCAATTCAT AGACTAGCAC 3 00 

CTAAATTTGA TGAATTATCT ACAGAAGTAG AAATTCTTGA AACTGGTATT 3 50 

3 5 AAAGTTGTTG ATTTATTAGC ACCATATATT AAAGGTGGTA AAGTTGGATT 4 00 

GTTTGGTGGT GCCGGAGTAG GTAAAACGGT ATTAATTCAA GAATTAATCA 450 

ACAATATTGC TCAAGAACAT GGTGGTATTT CTGTGTTTGC CGGAGTAGGT 50 0 

GAACGTACAC GTGAAGGTAA TGAC TTATAT TATGAAATGA GCGATAGTGG 550 

CGTAATTAAG AAAACAG CGA TGGTATTTGG CCAAATGAAT GAACCACCTG 60 0 

4 0 GTGCACGTAT GAGAGTTGCG TTATCTGCCT TAACAATGGC TGAATATTTC 65 0 

CGTGACGAGC AAGGACAAGA CGTATTGCTG TTTAT CGATA ATATATTCCG 700 

TTTTACACAA GCAGGTTCAG AAGTATCTGC ATTACTTGGA CGTATGCCAT 75 0 

CTGCCGTTGG TTATCAACCA ACATTGGCTA CAGAAATGGG ACAATTGCAA 800 

GAAAGAATTA CATCTACAAA TAAAGGTTCT GTAAC 83 5 

45 

2) INFORMATION FOR SEQ ID NO: 3 75 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 
60 (B) STRAIN: ATCC 15305 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375 

GAGCACAATG AAGTTCCAGA AATTAACAAT GCCTTAGTCG TAGACGTTGA 5 0 

5 AAGAGATGAA GGTACAGTAT CTCTTACATT AGAAGTGGCA TTACAACTTG 100 

GCGATGATGT CGTACGTACA ATTGCAATGG ATTCTACTGA TGGTGTTAAA 150 

CGTGGTACAG AAGTTCGAGA TAGCGGAGAT AGCATCAGTG TTC CAGTTGG 2 00 

TGATGCTACG TTAGGACGTG TGTTTAATGT TCTTGGTGAT ACAATTGACT 250 

TAGACGAGAA GC TTGATACT TCTGTCAAAC GTGATCCAAT TCATAGAGAA 3 00 

10 GCACCTGCAT TCGATCAATT ATCAACAAAA GTTGAAATCT TAGAAACAGG 3 50 

TATTAAAGTA ATTGATTTAC TTGCACCATA TATTAAAGGT GGTAAAATCG 4 00 

GTTTATTCGG TGGCGCTGGT GTAGGTAAAA CAGTATTAAT TCAAGAATTA 4 50 

ATTAATAATA TAGCTCAAGA ACATGGTGGT ATTTCAGTAT TTGCCGGCGT 500 

AGGTGAACGT ACGCGTGAAG GTAATGACTT ATACTACGAA ATGAGTGATA 55 0 

15 GTGGTGTTAT TAAGAAAACA GCTATGGTCT TCGGACAAAT GAATGAGCCA 60 0 

C CTGGTGCGC GTATGCGTGT TGCTTTATCA GGCTTAACAA TGGCTGAACA 65 0 

CTTCCGTGAT GTACAAGGAC AAGATGTTTT ACTATTTATT GATAACATAT 700 

TCAGATTTAC GCAAGCTGGT T CAGAAGT AT CAGCACTATT AGGTCGTATG 750 

CCATCAGCCG TTGGTTATCA ACCTACCCTT GCTACTGAAA TGGGTCAATT 80 0 

2 0 ACAAGAACGT ATTACATCAA CAACTAAAGG ATCTGTAACG TC 842 



25 



2) INFORMATION FOR SEQ ID NO: 3 76 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376 

4 0 TGATGAACTG CCTAAGATTA ATAACGCATT AGTGCTAGAT GTACCTAAGA 50 

AAGATGGCAC GACTGAATCT CTTACATTAG AAGTAGCACT TGAATTAGGC 10 0 

GACGACGTAG TTAGAACTAT CGCCATGGAC TCTACAGACG GAATTAAACG 150 

TGGTGACGAC GTTAAAGACA CTGGTCGTCC AATCAGTGTA CCTGTCGGTG 2 00 

AAGATACGTT AGGAAGAGTA TTTAACGTTT TAGGTGATCC AAT CGATAAT 250 

45 GATGGACCGA TTTCTGAATC AGTTCCACGT GAACCAATTC ATAGACAACC 300 

ACCTAAATTT GATGAATTAT CAACAAAAGT TGAACTACTT GAAACTGGTA 35 0 

TCAAAGTAGT AGACTTATTA GCACCATATA TCAAAGGTGG TAAAGTTGGT 40 0 

TTATTCGGTG GTGCCGGAGT AGGTAAAACT GTATTAATCC AAGAATTAAT 450 

TAATAACATC GCTCAAGAAC ACGGCGGTAT TTCAGTATTC GCAGGTGTTG 500 

5 0 GTGAACGTAC ACGTGAAGGT AACGACTTGT ACTTCGAAAT GAGCGACAGT 550 

GGTGTTATCA AGAAAACAGC GATGGTATTC GGACAAATGA ACGAACCACC 600 

TGGTGCACGT ATGCGTGTAG CTTTATCAGG TTTAACAATG GCTGAATACT 650 

TCCGTGATGT TAAAGGACAA GACGTTCTTT TATTCATCGA TAACATTTTC 700 

CGCTTCACAC AAGCAGGTTC TGAGGTATCA GCATTGCTTG GCCGTATGCC 750 

55 ATC AGCCGTT GGTTACCAAC CAACATTGGC AACAGAAATG GGTCAATTAC 800 

AAGAACGTAT CACTTCTACA ATGAAAGGTT CTATCACATC TA 842 



60 2) INFORMATION FOR SEQ ID NO: 377 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



15 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377 



CATAACGAAG TCCCTGATAT TAATAATGCC CTTATTATTG AAGTTCCAAA 50 

AGAAGATGGA ACGTTAAACT TAACATTAGA AGTTGCACTA CAATTAGGTG 100 

ATGATGTTGT ACGTACAATT GCAATGGATT CAACTGATGG TGTTCAAAGA 150 

GGCATGGATG TTAAAGACAC AGGTAAAGAT ATTAGTGTAC CTGTAGGCGA 2 00 

20 TGAAACGCTT GGAAGAGTGT TTAATGTACT AGGTGAAACA ATTGACTTGG 2 50 

AAGAGAAAAT TGATGATTCC GTACGTCGTG ATCCAATCCA TAGACAATCA 3 00 

CCAGGTTTCG ATGAATTATC TAC TGAAGTA GAAATCTTAG AAACAGGTAT 350 

TAAAGTAGTA GACTTATTAG CACCTTACAT TAAAGGTGGT AAAGTTGGAC 4 00 

TATTCGGTGG TGC CGGAGTA GGTAAAACCG TTTTAATCCA AGAATTAATT 4 50 

2 5 AACAATATTG CACAAGAACA TGGTGGTATT TCAGTATTCG CGGGTGTAGG 500 

TGAACGTACT CGTGAAGGTA ATGATTTATA CTATGAAATG AGTGATAGTG 550 

GTGTAATTAA GAAAACAGCG ATGGTATTTG GACAAATGAA TGAACCACCT 600 

GGCGCACGTA TGCGTGTAGC TTTATCTGGT TTAACTATGG CTGAATACTT 650 

CCGTGATGAA CAAGGACAAG ACGTACTTTT ATTCATCGAT AATATTTTCA 7 00 

30 GATTTACACA AGCTGGTTCT GAAGTTTCTG CATTACTTGG TCGTATGCCT 750 

TCAGCCGTTG GTTACCAACC AACATTAGCA ACTGAAATGG GTCAATTACA 800 

AGAACGAATT ACAT CTACAA ATAAAGGTTC TGTAACATCT A 841 



2) INFORMATION FOR SEQ ID NO: 378 



( i ) SEQUENCE CHARACTER I STI CS : 
(A) LENGTH: 84 6 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 

(B) STRAIN: ATCC 51726 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 

TTTAACACGA ATGAACCGCT TCCTGAGATA AATAATGCAC TTGTTGTTTA 50 

CAAAGACAGT GAGAAAAAAC ATAAAATCGT TCTTGAAGTA GCTCTTGAAC 100 

TTGGTGAAGG CCTCGTTCGT ACCATTGCTA TGGAATCAAC TGATGGTTTG 150 

55 ACACGTGGTC TAGAAGTTCT TGATACAGGC CGTGCAATCA GTGTACCAGT 200 

TGGTAAAGAA ACGCTTGGAC GTGTCTTCAA CGTTCTTGGT GATGCTATCG 250 

ATCTTGAAGA AC CATTTGGA GAAGATGCAG AACGTCACCC CATT CATAAG 3 00 

AGTGCCCCAA CTTTTGATGA ATTATCAACG TCAACAGAAA TCCTTGAAAC 3 50 

AGGGATTAAA GTTATCGACC TACTTGCCCC TTACTTAAAA GGAGGGAAGG 4 00 

60 TTGGACTTTT CGGTGGTGCC GGAGTTGGTA AGACCGTTCT TATCCAAGAG 4 50 
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TTGATTCATA ACATTGCTCA AGAGCATGGT GGTATTTCAG TATTTACCGG 500 

AGTTGGTGAA CGTACACGTG AAGGTAATGA CCTCTATTGG GAAATGAAAG 550 

AATCAGGCGT TATTGAAAAA ACAGCTATGG TATTTGGTCA GATGAATGAG 600 

CCACCTGGTG CACGTATGCG TGTAGCCCTT ACTGGTTTGA CAATCGCTGA 65 0 

ATATTT CCGT GATGTTGAAG GACAGGACGT GCTTCTCTTT ATTGATAACA 70 0 

TTTTTCGTTT C AC ACAAG CA GGTTCTGAAG TTTCAGCTCT TCTTGGACGT 750 

ATGCCATCAG CCGTTGGTTA TCAACCAACC TTGGCAACTG AAATGGGTCA 80 0 

ATTGCAAGAA CGTATCACGT CAACTAAAAA AGGTTCTGTT ACATCA 84 6 



2 ) INFORMATION FOR SEQ ID NO: 379 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 

TTGCAAGTGG CGACAAACTT C CTGAGATTA ATAATGCATT GATTGTCTAT 50 
AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTA CTCTTGAACT 10 0 

3 0 TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAG C AC CATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

3 5 GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 45 0 

T AATC CACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

40 CAC CAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 7 00 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 

45 

2) INFORMATION FOR SEQ ID NO: 3 80 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TO POLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 
60 (B) STRAIN: ATCC 13813 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

5 AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 10 0 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 15 0 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 2 50 

TCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAAC CA AT CCATAAAA 3 00 

10 AAGCAC CATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 3 50 

GGT AT TAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 4 50 

TAATC CACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

15 ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 80 0 

2 0 TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 84 6 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 381 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12 973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 



4 0 TTGCAAGTGG 
AAAAATGGCG 
TGGTGACGGC 
CACGTGGTTT 
GGTAAGGATA 

4 5 CCTTGAAGAG 
AAGCACCATC 
GGTATTAAAG 
TGGACTTTTC 
TAATC CACAA 

50 GTAGGAGAAC 
ATCTGGCGTT 
CACCAGGAGC 
TACTTCCGTG 
CTTCCGTTTC 

55 TGCCTTCAGC 
TTACAAGAGC 



CGACAAACTT 
ATAAGTCACA 
CTCGTTCGTA 
GGAAGTATTA 
CTTTGGGTCG 
CCTTTTGCAG 
ATTTGATGAA 
TTATTGACTT 
GGTGGTGCGG 
CATCGCCCAA 
GTACTCGTGA 
ATTGAAAAAA 
ACGTATGCGT 
ATGTAGAAGG 
ACACAAGCTG 
CGTTGGTTAT 
GTATCACTTC 



CCTGAGATTA 
AAAAGTAGTA 
CAATCGCTAT 
GATACTGGTC 
TGTCTTCAAC 
AAGATGCAGA 
TTATCAACAT 
ATTAGCACCT 
GTGTTGGTAA 
GAACATGGTG 
AGGGAATGAC 
CGGCTATGGT 
GTGGCACTTA 
ACAAGATGTG 
GGTCAGAAGT 
CAACCAACAC 
AACTAAAAAA 



ATAATGCATT 
CTTGAAGTTG 
GGAATCAACT 
GTGCAATTAG 
GTTCTTGGAG 
ACGTCAACCA 
CATCAGAAAT 
TACTTAAAAG 
AACCGTTCTT 
GTATTTCAGT 
CTTTATTGGG 
CTTTGGTCAA 
CTGGTCTTAC 
CTTCTCTTCA 
GTCAGCGCTT 
TTGCTACAGA 
GGTTCTGTTA 



GATTGTCTAT 5 0 

CTCTTGAACT 10 0 

GATGGGCTTA 15 0 

TGTGCCGGTT 200 

ATGCTATTGA 250 

ATCCATAAAA 3 0 0 

CTTAGAAACA 3 5 0 

GTGGTAAAGT 4 0 0 

ATTCAAGAAT 4 5 0 

ATTTACTGGT 500 

AAATGAAAGA 550 

ATGAATGAAC 6 00 

AATAGCTGAG 650 

TTGATAATAT 7 0 0 

TTAGGTCGTA 750 

AATGGGACAA 80 0 

CCTCA 84 5 



60 2) INFORMATION FOR SEQ ID NO: 382 
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{ i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 



15 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

2 0 GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 2 50 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 3 50 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 4 50 

25 TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 6 00 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 70 0 

3 0 CTTCCGTTTC ACACAAGCTG GGT CAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 8 00 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 845 



2) INFORMATION FOR SEQ ID NO: 383 



( i ) SEQUENCE CHARACTER I STICS : 
(A) LENGTH: 845 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



( ii ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDC SS1073 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

55 CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 3 50 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

6 0 TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 
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TAATCCACAA CATCGCCCAA GAACATGGTG 
GTAGGAGAAC GTACTCGTGA AGGGAATGAC 
ATCTGGCGTT ATTGAAAAAA CGGCTATGGT 
CACCAGGAGC ACGTATGCGT GTGGCACTTA 
5 TACTTCCGTG ATGTAGAAGG ACAAGATGTG 
CTTCCGTTTC ACACAAGCTG GGTCAGAAGT 
TGCCTTCAGC CGTTGGTTAT CAACCAACAC 
TTACAAGAGC GTATCACTTC AACTAAAAAA 



GTATTTCAGT ATTTACTGGT 500 

CTTTATTGGG AAATGAAAGA 550 

CTTTGGTCAA ATGAATGAAC 600 

CTGGTCTTAC AATAGCTGAG 650 

CTTCTCTTCA TTGATAATAT 700 

GTCAGCGCTT TTAGGTCGTA 75 0 

TTGCTACAGA AATGGGACAA 800 

GGTTCTGTTA CATCA 84 5 



10 

2) INFORMATION FOR SEQ ID NO: 3 84 

( i ) SEQUENCE CHARACTER I STI CS : 
15 (A) LENGTH: 84 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus dysgalactlae 

(B) STRAIN: ATCC 43078 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384 



TTGCTAGTGG GGACAAACTT C C AGAGATT A ATAATGCATT GATTGTTTAT 50 

AAAGATAGTG ATAAAAAGCA AAAAATCGTC CTTGAAGTTG CTCTGGAACT 10 0 

3 0 TGGTGACGGT ATGGTGCGAA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGGTT AGAAGTTCTT GACACTGGTC GTGCGATTAG TGTAC CAGTA 2 00 

GGTAAAGAAA CTTTGGGACG CGTCTTTAAT GTACTTGGAG AAACCATTGA 250 

CTTGGAAGAA CCATTTGCAG AAGACGTTGA CCGTCAGCCA ATCCATAAAA 3 00 

AAGCACCATC GTTTGATGAA TTATCAACAT CAT CAGAAAT TCTTGAAACT 3 50 

3 5 GGTATCAAGG TAATTGACCT TCTTGCCCCT T AC CTTAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGGGGTGCCG GAGTTGGTAA GACTGTCCTT AT C CAAGAAT 450 

TAATCCACAA TATCGCCCAA GAACACGGAG GTATTTCAGT ATTTACCGGT 50 0 

GTTGGTGAGC GAACACGTGA AGGAAATGAC CTTTACTGGG AAATGAAAGA 55 0 

ATCAGGCGTT ATTGAGAAAA CTGCCATGGT TTTTGGTCAG ATGAATGAGC 60 0 

40 CGC CTGGGGC ACGTATGCGT GTAGCCCTTA CTGGTTTAAC CATTGCTGAG 650 

TATTTCCGTG ATGTAGAAGG CCAAGATGTT TTGCTCTTTA TTGATAATAT 700 

CTTCCGTTTC ACTCAGGCAG GTTCAGAAGT ATCAGCCCTC TTAGGCCGTA 750 

TGCCTTCTGC TGTTGGTTAC CAACCGACCC TTGCTACTGA AATGGGACAA 800 

TTGCAAGAAC GTATTACGTC AACTCAAAAA GGATC TGTTA CTTCT 84 5 

45 



2) INFORMATION FOR SEQ ID NO: 385 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Streptococcus egui subsp. egui 
6 0 (B) STRAIN: ATCC 952 8 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385 

TTGCGAGTGG GGACAAACTA CCAGAGATTA ATAATG CGTT GATAGTTTAT 50 

5 AAAGATGGCG ATAAAAAGCA AAAAATCGTT CTCGAGGTTG CCCTAGAGCT 10 0 

TGGAGACGGT ATGGTACGTA CAATTGCTAT GGAATCAACC GATGGGCTTA 15 0 

CACGTGGATT AGAGGTTCTT GATACTGGTC GTGCCATTAG TGTACCAGTT 2 00 

GGTAAAGAGA CTCTAGGTCG TGTTTTCAAC GTTCTTGGTG AAACCATCGA 250 

CCTAGAAGCA CCATTTGCAG ATGATGTTAA TCGTGAACCG ATCCATAAAA 3 00 

10 AAGCACCAGC CTTTGATGAA TTGTCAACAT CAT C AG AAAT TCTTGAAACA 350 

GGTATCAAGG TTATTGACCT GCTTGCCCCT TACTTAAAGG GTGGTAAGGT 4 00 

CGGTCTTTTC GGTGGTGCCG GAGTTGGTAA AACCGTTCTT ATCCAAGAAT 450 

TAATCCACAA TATCGCTCAA GAGCATGGTG GGATCTCGGT ATTTACCGGT 50 0 

GTTGGTGAGC GTACGCGTGA AGGAAATGAC CTTTACTGGG AAATGAAGGA 550 

15 ATCAGGCGTT ATTGAAAAAA CAGCCATGGT TTTTGGTCAG ATGAATGAAC 600 

CACCAGGAGC CCGTATGCGT GTTGCCTTGA CCGGCTTGAC AATTGCTGAA 65 0 

TATTT CCGCG ATGTTGAAGG CCAAGACGTC CTGCTCTTCA TTGACAATAT 70 0 

TTTCCGCTTT ACTCAAGCAG GCTCAGAGGT ATCAGCCCTT CTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAC CAGCCAACAC TTGCCACTGA AATGGGACAA 800 

2 0 TTGCAAGAGC GTATCACCTC AACGAAAAAA GGCTCTGTTA CCTCTA 84 6 



25 



2) INFORMATION FOR SEQ ID NO: 3 86 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
35 (A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 2 7335 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386 

4 0 TTG CAGCTGG TGATAAATTA CCTGAGATAA ATAATGCACT TGTAGTCTAT 5 0 

AAAAATGACG AAAATAAATC AAAAATCGTC CTTGAAGTAG CTCTTGAGCT 100 

TGGTGATGGA GTGGTTCGAA CTATTGCCAT GGAATCCACT GATGGGTTGA 150 

CTCGTGGCAT GGAAGTTCTA GATACTGGTC GACCAATTTC TGTTCCAGTT 2 00 

GGGAAAGAAA CACTTGGTCG CGTCTTTAAC GTTTTAGGCG ATAC CATTGA 250 

4 5 TTTGGATACT CCATTCGGCG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCTCCAAC TTTTGATGAG TTATCTACTT CATCAGAAAT CTTAGAAACA 3 50 

GGAATAAAGG TTATTGACCT TTTAGCCCCC TACCTCAAAG GTGGGAAAGT 4 00 

CGGCCTCTTC GGTGGTGCTG GCGTTGGGAA AACTGTCTTG ATTCAAGAGT 450 

TGATTCATAA TATCGCCCAA GAACACGGCG GGATTTCAGT CTTTACTGGT 5 00 

50 GTTGGGGAAC GAACTCGTGA AGGGAATGAC CTGTACTGGG AAATGAAAGA 55 0 
ATCTGGTGTT ATCGAAAAGA CGGCTATGGT CTTTGGGCAA ATGAATGAAC 
CGCCTGGAGC ACGTATGCGT GTAGCTTTGA CTGGGTTAAC GATTGCAGAG 
TATTT CCGTG ATGTGGAAGG TCAAGATGTT CTTTTGTTTA TTGATAATAT 
TTTCCGTTTC ACTCAAGCTG GTTCTGAAGT GTCAGCCCTT CTTGGTCGTA 

55 ™?5* TCAGC TGTTGGT TAC CAACCAACCT TGGCTACTGA AATGGGGCAA buu 

TTACAAGAAC GTATTACAT C AACGAAAAAA GGTTCTGTTA CCTCAA 846 



600 
650 
700 
750 
800 



60 2 ) INFORMATION FOR SEQ ID NO: 387 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387 

GCAGCTGGTG ATAAACTTCC TGAGATTAAC AATGCATTGG TCGTTTATAC 

TGATGAACAA AAGTCTAAAC GTATCGTGCT CGAAGTAGCT CTTGAACTTG 

GAGAAGGTGT GGTTCGTACC ATTGCCATGG AATCTACTGA TGGATTGACT 
CGTGGACTAG AAGTTCTGGA CACTGGTCGT CCAATCAGCG TTCCTGTTGG 

TAAAGATACC CTTGGACGTG TCTTTAACGT TCTTGGTGAT ACCATTGACT 

TGGAAGCACC TTTTGCAGAC GATGCAGAGC GTGAACCAAT TCACAAAAAA 

GCACCAACTT TCGATGAATT GTCAACATCT ACTGAAATCC TTGAAACAGG 

GATTAAAGTT ATCGACTTGC TAGCCCCTTA CCTTAAGGGT GGTAAAGTCG 

GACTCTTCGG TGGTGC CGGT GTTGGTAAAA CCGTTCTTAT TCAAGAGTTG 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCCGTGT TTACAGGTGT 

TGGTGAACGT ACACGTGAAG GTAATGACCT TTACTGGGAA ATGAAAGAAT 

CTGGCGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAAAT GAACGAACCA 

CCTGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCGGAATA 

CTTCCGTGAT GTCGAGGGTC AAGACGTTCT TCTCTTCATC GATAACATCT 

TCCGTTTCAC TCAAGCAGGT TCTGAGGTTT CTGCCCTTCT TGGTCGTATG 

CCATCAGCCG TTGGTTACCA ACCTACACTT GCTACTGAAA TGGGT CAATT 

GCAAGAACGT ATCACATCAA CTAAAAAAGG TTCTGTTACA TCT 



2) INFORMATION FOR SEQ ID NO: 388 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388 

TTGCAGCAGA AGATAAACTT CCTGAGATTA ACAACGCACT CGTTGTATAT 
AAAAATGATG ATTCCAAACA AAAAGTCGTG CTTGAAGTGG CTTTGGAACT 
TGGTGATGGC GTTGTACGGA CCATTGCCAT GGAATCAACG GATGGATTGA 
CACGTGGGAT GGAAGTTCTC GATACAGGTC GTCCCATCTC TGTTCCAGTC 
GGTAAAGAAA CGCTGGGTCG TGTCTTCAAT GTGTTGGGAG ATACCATTGA 
CCTTGAAGAG TCTTTTCCGG CAGATTTTGA ACGTGAGCCT ATC CATAAGA 
AAGCGCCGGC TTTTGACGAA TTATCTACTT CAAGCGAAAT TTTGGAAACA 
GGGATTAAGG TTATCGACCT CCTAGCACCT TATCTAAAAG GTGGTAAGGT 
TGGTCTCTTC GGTGGTGCTG GTGTTGGTAA AACCGTTCTT ATC CAAGAAT 
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TGATTCACAA TATTGCCCAA GAACACGGTG GTATCTCTGT ATTTAC CGGA 5 00 

GTTGGCGAGC GTACCCGTGA AGGGAACGAT CTTTACTGGG AAATGAAAGA 55 0 

ATCAGGTGTT ATTGAAAAAA CGGCCATGGT ATTTGGTCAG ATGAATGAGC 60 0 

CAC CAGGAGC CCGTATGCGT GTTGCTCTTA CTGGTTTGAC TATTGC GGAA 65 0 

TACTTCCGTG ATGTGGAAGG GCAGGATGTT CTTCTGTTCA TCGATAATAT 70 0 

CTTCCGTTTC ACACAGGCTG GTTCAGAAGT GTCTGCCCTC TTGGGTCGTA 75 0 

TGCCATCAGC CGTTGGTTAT CAGCCAACAC TTGCGACGGA GATGGGACAA 800 

TTGCAGGAGC GTATTACCTC AACCAAGAAG GGTTCTGTTA C 841 



2) INFORMATION FOR SEQ ID NO: 3 89 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 84 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uberis 

(B) STRAIN: ATCC 19436 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389 



GCAAACGGTG AAAAATTACC AGAGATTAAT AATGCATTGA TAGTTTATAA 50 

AGGTAGCGAT AAAAAACAAA AGATTGTTCT TGAAGTTGCT TTGGAACTTG 10 0 

30 GGGACGGAAT GGTTCGTACA ATCGCTATGG AAT CAACTGA TGGGCTTACA 150 

CGTGGATTAG AAGTTTTAGA TACTGGCCGT GCCATTAGTG TAC CAGTCGG 2 00 

AAAAGAAACT TTGGGTCGTG TTTTCAATGT GCTTGGTGAA ACCATTGATT 25 0 

TGGATGAAC C ATTTGCCGCT GATGCTGCAA GAGAACCCAT CCATAAAAAA 3 00 

GCCCCAGCAT TTGATGAACT ATCAACGTCT TCAGAAATTC TTGAAACCGG 350 

35 AATAAAAGTT ATTGACTTAT TAGCCCCTTA TCTCAAAGGT GGTAAAGTTG 4 00 

GTTTATTTGG TGGTGCCGGA GTAGGTAAAA CGGTTTTAAT TCAAGAATTA 450 

ATTCATAATA TTGCACAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 50 0 

TGGTGAAAGA ACTCGTGAAG GTAATGACCT TTATTGGGAA ATGAAAGAAT 55 0 

CTGGCGTTAT TGAAAAAACA GCCATGGTAT TTGGACAAAT GAACGAAC CA 60 0 

4 0 CCAGGAGCAC GTATGCGCGT TGCTTTAACA GGTTTAACCA TTGCTGAATA 65 0 

TTTCCGGGAT GTTGAAGGTC AAGATGTTTT GCTCTTTATT GACAACATTT 70 0 

TCCGTTTCAC GCAAGCTGGT TCAGAAGTTT CAGCC CTATT GGGTCGTATG 750 

CCTTCAGCGG TAGGATACCA ACCAACACTT GCTACCGAAA TGGGACAATT 800 

GCAAGAAAGA ATTACCTCAA CTAACAAGGG ATCTGTTACT TCTA ^ 



45 



2) INFORMATION FOR SEQ ID NO: 390 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 
6 ° (B) STRAIN: ATCC 33301 



844 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390 

TTC CCTCAGG ACGCTGTACC ACAGGTGTAC AACGCTCTTG AGGTTGAAAA 5 0 

5 TGGTGATACC CGTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 10 0 

TCGTTCGTAC GATTGCAATG GGAACCTCTG ACGGCCTGAA ACGTGGCCTT 15 0 

AAGGTGACCG ATCTGCAAAA AC CGATTCAG GTACCGGTCG GTAAAGCGAC 2 00 

GCTGGGCCGT ATCATGAACG TACTGGGTCA GCCAATCGAT ATGAAAGGCG 250 

ACCTGAAGAA CGAAGATGGT AGCAATGTTG AGGTGAACTC TATTCACCGT 3 00 

10 GCAGCGCCAA GCTACGAAGA ACTGGCTAAC TCTACTGAGC TGCTGGAAAC 3 50 

GGGTATCAAG GTTATCGACC TGATCTGTCC GTTTGCAAAA GGCGGTAAAG 4 00 

TGGGTCTGTT CGGTGGTGCG GGTGTAGGTA AGACCGTCAA CATGATGGAA 4 50 

CTGATCCGTA ACATCGCTAT CGAGCACTCT GGTTACTCTG TATTTGCAGG 500 

GGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGACCG 5 50 

15 AGTCTAACGT TCTGGATAAA GTTGCTCTGG TTTATGGCCA GATGAACGAG 600 

CCACCAGGAA ACCGTCTGCG CGTTGCGCTG ACCGGTCTGA CTATGGCTGA 650 

AAAATTCCGT GACGAAGGCC GTGACGTACT GCTGTTCGTT GAT AAC AT CT 7 00 

ATCGTTATAC CCTGGCCGGT ACTGAAGTTT CAGCACTGCT GGGT CGTATG 750 

CCTTCTGCGG TAGGTTATCA GCCAACACTG GCCGAAGAAA TGGGTGTTCT 8 00 

2 0 TCAGGAACGT ATCACGTCAA CCAAAACCGG TTCAATCACT TCCGTA 8 96 



2) INFORMATION FOR SEQ ID NO: 391 



25 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Trabulsiella guamensls 

(B) STRAIN: ATCC 49490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391 

4 0 TTC CCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTTATGAA 50 

TGGTAGTGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGTGGTGGTA 10 0 

TCGTACGTAC TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 15 0 

GATGTAAAAG ATCTCGAGCA TCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

GCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCGATCGAT ATGAAAGGCG 2 50 

45 ACATCGGCGA AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCGTCCTAC 3 00 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTGAT TCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAACGTAC 500 

50 TCGTGAGGGT AACGACTTCT AC C ACGAAAT GACCGACTCC AACGTTCTGG 55 0 

ACAAAGTATC CCTGGTGTAT GGACAGATGA ACGAGCCGCC GGGAAAC CGT 60 0 

CTGCGCGTTG CACTGACCGG TCTGACCATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGT CGATAA CATCTACCGT TACACCCTGG 700 

CGGGTACTGA AGTAT C TGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 75 0 

55 TACCAGCCGA CCCTGGCGGA AGAGATGGGT GTTCTTCAGG AACGTATCAC 800 

CTCAACCAAA ACCGGTTCTA TCACCTCCG 82 9 



6 0 2) INFORMATION FOR SEQ ID NO: 3 92 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia bercovieri 

(B) STRAIN: ATCC 43970 



15 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392 



CGAATTCCCC CAAGACGCTG TACCAAAAGT GTACAACGCC CTTGAGGTTG 50 

AAGGCACAGC TCAGAAGCTG GTGCTGGAAG TT CAGCAAC A GCTGGGCGGT 100 

GGTGTTGTTC GTTGTATCGC AATGGGCTCT TCCGATGGTC TGAGCCGCGG 150 

GTTGAAAGTC ATCAACCTGG AACACCCAAT TGAAGTGCCG GTGGGTAAAT 2 00 

2 0 CAACTCTGGG CCGTATCATG AAC GTATTGG GTGACCCAAT CGACATGAAA 25 0 

GGTCCTATCG GTGAAGAAGA GCGTTGGGCA ATCCACCGCG AAGCGCCTTC 3 00 

TTACGAAGAG CTTGCCAGCT CGCAAGATCT GTTAGAAACC GGTATCAAGG 350 

TAATGGATCT GATTTGTCCG TTCGCTAAGG GCGGTAAAGT CGGTCTGTTC 4 00 

GGTGGTGCGG GTGTGGGTAA AACAGTCAAC ATGATGGAGC TGATTCGTAA 450 

25 TATTGCGATT GAGCACTCAG GTTATTCTGT ATTTGCCGGT GTGGGTGAGC 50 0 

GTACTCGTGA GGGTAACGAC TTCTACCACG AGATGACTGA CTCCAACGTT 550 

CTGGACAAAG TATCCTTGGT TTATGG CCAG ATGAATGAGC CAC C AGGTAA 60 0 

CCGTCTGCGC GTTGCACTGA CCGGCTTGAC CATGGCGGAG AAATTCCGTG 65 0 

ATGAAGGTCG TGATGTACTG TTATTCATCG ATAACATCTA TCGTTATACC 70 0 

3 0 CTGGCCGGTA CAGAGGTATC TGCACTGCTA GGTCGTATGC CATCAGCGGT 75 0 

AGGCTATCAG CCAACGCTGG CAGAAGAGAT GGGTGTGTTG CAGGAACGTA 80 0 

TCACTTCCAC CAAGACGGGT TCAATCACCT CCGTA 83 5 



2) INFORMATION FOR SEQ ID NO: 3 93 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 812 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 

GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTTGAAGGCG CAGCTGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AACAGC TGGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTCATCAAC 150 

55 CTGGAACACC CAATTGAAGT GCCTGTGGGC AAGTCAACTC TGGGCCGTAT 200 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGCGAAG 25 0 

AAGAGCGTTG GGCAATCCAT CGTGAAGCGC CTTCTTACGA AGATCTTGCC 3 00 

AGCTCGCAAG ACTTGTTAGA AACCGGTATC AAGGTAATGG ACTTGATTTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

60 GTAAAACGGT AAACATGATG GAGCTTATTC GTAACATTGC GATTGAGCAC 450 
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TCAGGTTATT CCGTATTTGC TGGCGTGGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAGATGA CTGACTCCAA CGTTCTGGAC AAAGTATCCT 550 

TGGTTTATGG CCAAATGAAT GAGCCACCAG GTAACCGTCT GCGCGTTGCA 600 

CTGACCGGCT TGACCATGGC GGAGAAAT TC CGTGATGAAG GTCGTGACGT 650 

ATTGCTGTTC AT CGATAACA TCTATCGCTA TACCTTAGCC GGTACGGAAG 700 

TTTCCGCACT GCTGGGTCGT ATGCCATCTG CCGTAGGTTA CCAGCCAACG 750 

CTGGCAGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT CCACCAAGAC 800 

GGGTTCAATC AC 812 



2) INFORMATION FOR SEQ ID NO: 3 94 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 ( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394 



AAAGTGTACA ACGCCCTTGA GGTTGAAGGT ACTGCTGAGA AGTTAGTACT 50 

GGAAGTT CAG CAACAGCTGG GCGGTGGTGT TGCT CGTTGT ATCGCCATGG 100 

3 0 GCTCTTCCGA TGGTTTGAGC CGCGGGTTGA AAGTTGT CAA CCTGGAACAC 150 

CCAATTGAAG TACCGGTTGG TAAATCAACT CTGGGCCGTA TCATGAACGT 2 00 

ATTGGGTGAC CCAATCGACA TGAAAGGTCC TATCGGTGAA GAAGAGCGTT 250 

GGGCAATCCA CCGCGAAGCG CCTTCTTACG AAGAGCTTGC CAGCTCGCAA 3 00 

GATCTGTTAG AAAC CGGTAT CAAGGTAATG GATCTGATTT GCCCGTTCGC 3 50 

35 TAAAGGCGGT AAAGTCGGTC TGTTCGGTGG TGCGGGTGTA GGTAAAACGG 400 

TAAACATGAT GGAGCTGATC CGTAATATCG CGATCGAG CA CTCAGGTTAT 450 

TCCGTATTTG CGGGTGTTGG TGAACGTACC CGTGAGGGTA ACGACTTCTA 500 

CCACGAGATG ACTGACTCCA ACGTTCTGGA CAAAGTATCC TTGGTTTATG 550 

GCCAGATGAA TGAGC C AC CA GGTAACCGTC TTCGCGTTGC ACTGACCGGT 60 0 

40 CTGACCATGG CGGAGAAATT CCGTGATGAA GGTCGTGACG TATTGCTGTT 65 0 

CATCGATAAC ATCTATCGTT ATACCTTGGC CGGTACGGAA GTATC CGCAC 700 

TGCTGGGTCG TATGCCATCT GCGGTAGGCT ATCAGCCAAC GCTGGCAGAA 75 0 

GAGATGGGTG TGTTGCAGGA ACGTATTACT TCCAC CAAGA CGGGTTCAAT 800 

CA 802 

45 



2 ) INFORMATION FOR SEQ ID NO: 3 95 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 
60 (B) STRAIN : ATCC 29909 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395 

GCTGTACCAA GAGTGTACAA CGCCCTTGAG GTTGAAGGCA CTGCTGAGAA 50 

5 GCTGGTGCTG GAAGTTCAGC AACAGCTAGG CGGTGGTGTT GTTCGTTGTA 10 0 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GCGGGTTGAA AGTCATCAAC 15 0 

CTGGAACACC CAATTGAAGT GCCGGTTGGT AAATCAACTC TGGGCCGTAT 200 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT AT CGGTGAAG 250 

AAGAGCGTTG GGCAATCCAC CGCGAAGCGC CTTCTTACGA AGAGCTTGCC 3 00 

10 AGCTCACAAG ATTTGTTAGA AACCGGTATC AAAGTAATGG ACTTGATTTG 350 

CCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACAGT AAACATGATG GAGCTTATTC GTAACATCGC GATTGAGCAC 450 

TCAGGTTATT CTGTATTTGC TGGTGTGGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAGATGA CTGACTCCAA CGTTCTGGAC AAAGTAT CCT 550 

15 TGGTGTATGG CCAGATGAAT GAGCCACCAG GTAACCGTCT GCGCGTTGCA 6 00 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGTTGTTC ATCGATAACA TCTATCGCTA TACCTTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCAACG 750 

CTGGCAGAAG AGATGGGTGT GTTGCAGGAA CGTATTACGT CCACCAAGAC 800 

20 GGGTTC 806 



2) INFORMATION FOR SEQ ID NO: 3 96 



25 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 06 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 5 (A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396 

4 0 GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTAGAAGGCA CAACTGAAAA 50 

GTTAGTGCTG GAAGTTCAGC AACAGTTGGG CGGTGGTGTT GTTCGTTGTA 10 0 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTAACCAAC 15 0 

CTGGAACACC CGAT CGAAGT ACCGGTTGGT AAAGCGACCC TTGGCCGTAT 20 0 

CATGAACGTA TTGGGTGAAC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 250 

45 AAGAGCGTTG GGCAATCCAT CGCGAAGCGC CTTCTTATGA AGAGCTTGCT 300 

AGCTCACAAG ATC TGTTAGA AACCGGTATC AAGGTTATGG ACCTGATTTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCTGGGTATT CTGTATTTGC CGGTGTAGGT GAG CGTAC CC GTGAGGGTAA 500 

50 TGACTT CTAC CATGAAATGA CTGACTCCAA CGTTTTGGAC AAAGTAT C CT 55 0 

TGGTTTACGG CCAGATGAAT GAGCCACCAG GTAACCGTCT ACGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 65 0 

ACTGCTGTTC ATCGATAATA TCTATCGTTA TACCCTAGCT GGTACGGAAG 700 

TATCCGCATT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA TCAGC CAAC A 75 0 

55 CTGGCTGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT CCACTAAGAC 80 0 

GGGTTC 806 



60 2) INFORMATION FOR SEQ ID NO: 397 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 



15 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397 



TTCCCCCAAG ACGCTGTACC AAAAGTGTAC AACGCCCTTG AGGTTGAAGG 50 

TGCAGCTGAG AAGCTTGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TTGTTCGTTG TATCGCAATG GGCTCTTCCG ATGGTTTGAG CCGTGGGTTG 15 0 

AAAGTTAT CA ACCTGGAACA CCCAATTGAA GTGCCAGTTG GTAAATCAAC 2 00 

2 0 TCTGGGCCGT ATCATGAACG TATTGGGTGA CCCAATCGAC ATGAAAGGCC 2 50 

CTATCGGTGA AGAAGAGCGT TGGGCAATCC ACCGTGAAGC GCCTTCTTAC 3 00 

GAAGAGC TTG CCAGCTCGCA AGATCTGTTA GAAACCGGTA TCAAGGTAAT 3 50 

GGATCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAGCTTAT TCGTAACATC 450 

2 5 GCGATTGAGC ACTCAGGTTA TTCTGTATTT GCCGGGGTAG GTGAACGTAC 5 00 

TCGTGAGGGT AACGACTTCT ACCACGAGAT GACTGACTCC AACGTTCTGG 550 

ACAAAGTATC CTTGGTTTAT GGC CAGATGA ATGAGCCACC AGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CTTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGCCGTGAC GTATTGCTGT TCATCGATAA CATTTATCGT TATACCCTAG 700 

3 0 CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCATC TGCGGTAGGC 750 

TATCAGCCAA CACTGGCAGA AGAGATGGGT GTGTTGCAGG AACGTATTAC 8 00 

TTCCACTAAG ACGGGTTCAA TCACCTCCG 82 9 



2) INFORMATION FOR SEQ ID NO: 3 98 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 819 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAAAA TGGTAACGAG 5 0 

AAACTGGTGC TGGAAGCTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 10 0 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 15 0 

5 5 ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC CCTGGGCCGT 2 00 

ATCATGAACG TCCTGGGTCA GCCGATCGAC ATGAAAGGCG ACATCGGTGA 250 

AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCTTCCTAT GAAGAGCTGT 300 

CCAGCTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTAAT GGATCTGATC 3 50 

TGCCCGTTCG CTAAGGGTGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 400 

6 0 AGGTAAAACT GTAAACATGA TGGAGCTTAT CCGTAACATC GCGATTGAGC 450 
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ACTCCGGTTA CTCTGTGTTT 
AACGACTTCT ACCACGAAAT 
CCTGGTGTAC GGCCAGATGA 
CGC TGACCGG CCTGACCATG 
5 GTTCTGCTGT TCGTCGATAA 
AGTATCCGCA CTGCTGGGTC 
CTCTGGCGGA AGAGATGGGT 
AC CGGTTCTA TCACCTCCG 



GCAGGCGTGG GTGAACGTAC 
GACCGACTCC AACGTACTGG 
ACGAGCCGCC GGGAAACCGT 
GCTGAGAAAT TCCGTGACGA 
CATCTACCGT TATACCCTGG 
GTATGCCTTC TGCGGTAGGT 
GTTCTTCAGG AACGTATCAC 



TCGTGAGGGT 500 

ATAAAGTATC 55 0 

CTGCGCGTTG 600 

AGGCCGTGAC 650 

CCGGTACGGA 700 

TATCAG CCAA 750 

CTCTACCAAA 8 0 0 
819 



2) INFORMATION FOR SEQ ID NO: 399 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 1097 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytics 

(B) STRAIN: ATCC 3 82 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399 



AAGCTTAAGG 
GAAGTTCCAG 
3 0 ACCGAGATTT 
GCCATCCTCA 
CAAGGAT GGT 
TTAAGCAGCT 
CAGGATCGAT 

3 5 GGTCGGTTAC 

ACGGTGACAA 
TGGACCAAGG 
GGC CATTGAT 
GACTTCCCCT 

4 0 GTTGGCCGAG 

CGCTCCCGCC 
AGATCCTCCC 
AAGAACGTTT 
CAAGAACGAC 
4 5 TTCTTAACCA 
TGCCACACTG 
CGACCGACGA 
CTGGTGATGC 
GAGGCCTTCA 

50 



CTGAGCGAGA 
ACCCCTAAGT 
CAT C AAGAAC 
TCATTGCTGG 
CAGACCCGAG 
CATTGTTGCT 
ACCTCGAGAT 
AACCCCAAGG 
CATGATCGAG 
AGACCAAGGC 
GCCATTGAGC 
CCAGGATGTC 
TCGAGACCGG 
AACGTGACCA 
CGATGGAGGT 
CCGTCAAGGA 
CCCCCCAAGG 
CCCCGGTCAG 
CCCACATTGC 
ACCGGTAAGA 
TGCCATTGTC 
CTGAGTACCC 



GCGAGGTATC 
ACTACGTCAC 
ATGATTACCG 
TGGTGTTGGT 
AGCACGCCCT 
ATCAACAAGA 
TTGCAAGGAG 
CTGTCCCCTT 
CCCTCTACCA 
CGGCGAGATC 
CCCCCGTGCG 
TACAAGATCG 
TGTTATCAAG 
CTGAGGTCAA 
TTCCCCGGTG 
TATCCGACGA 
GCTGCGACTC 
ATCGGTGCTG 
TTGCAAGTTC 
AGATGGAGGA 
AAGATGGTTC 
CCCTCTTGGT 



ACCATTGATA 
CGTTATTGAT 
GTACTTCCCA 
GAGTTCGAGG 
GCTCGCTTTC 
TGGACTCCGT 
ACTGCCAACT 
CGTCCCCATT 
ACTGTGACTG 
AAGGGTAAGA 
ACCCCACGAC 
GTGGTATCGG 
GCCGGTATGG 
GTCTGTCGAG 
ACAACGTCGG 
GGTAACGTTG 
TTTCAACGCT 
GTTACGCTCC 
GACACCCTGA 
CTCCCCCAAG 
CCTCCAAGCC 
CGATTCGCCG 



TCGCTCTCTG 5 0 

GCTCCCGGTC 10 0 

GGCTGACTGC 150 

CTGGTATCTC 2 00 

ACCCTCGGTG 2 50 

CAAGTGGTCT 3 00 

TCGTCAAGAA 350 

TCCGGATGGA 4 00 

GTACAAGGGA 45 0 

CCCTCCTCGA 50 0 

AAGCCCCTCC 550 

CACAGTGCCC 600 

KTGTTACCTT 65 0 

ATGCACCACG 70 0 

TTTCAACGTC 75 0 

CTGGTGACTC 8 00 

CAGGTCATTG 8 5 0 

CGTCCTTGAT 90 0 

TCGAGAAGAT 95 0 

TTCATCAAGT 100 0 

CATGTGTGTC 1050 

TCCGAGA 1097 



2) INFORMATION FOR SEQ ID NO: 400 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absidia corymbifera 
5 (B) STRAIN: ATCC 46775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400 

CAAGCTTAAG GCTGAACGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT 5 0 

10 GGAAGTTCGA GACTCCCAAG TACCACGTTA CCGTCATTGA TGCCCCTGGC 10 0 

CATCGT GAT T TCATCAAGAA CATGATTACT GGTACTTCCC AAGCTGACTG 150 

CGGTATCTTG ATTATTGCTG CTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

CCAAGGATGG TCAAACCCGT GAACACGCTT TGCTTGCTTT CACCCTTGGT 2 50 

GTCCGTCAAT TGATTGTCGC TATCAACAAG ATGGATTCCA CCAAGTACTC 3 00 

15 TGAGGCCCGT TACAACGAAA TTGTCAAGGA AGTCTCCACC TTCATCAAGA 3 50 

AGATTGGTTT CAACCCCAAG TCCGTTCCTT TCGTCCCTAT CTCTGGCTGG 4 00 

AACGGTGACA ACATGTTGGA GGARTCCACC AACATGCCTT GGTTCAAGGG 4 50 

ATGGAACAAG GAGACTAAGG CTGGTGCCAA GACYGGCAAG ACCCTTCTTG 5 00 

AAGCCATTGA CAACATTGAT CCCCCTGTTC GTCCTTCCGA CAAGCCCCTT 550 

20 CGTCTTCCCC TTCAAGATGT CTACAAGATC GGTGGTATTG GTACAGTTCC 600 

TGTCGGTCGT GTTGAGACTG GTGTCATCAA GCCTGGTATG GTTGTCACCT 65 0 

TCGCTCCCGC TAACGTCACC ACTGAAGTCA AGTCCGTYGA AATGCACCAC 700 

GAGCAACTTG CTGAAGGTGT TCCCGGTGAC AACGTCGGTT TCAACGTCAA 75 0 

GAACGTTTCC GTCAAGGATA TCCGCCGTGG TAACGTYTGC TCTGACTCCA 800 

25 AGAACGACCC CGCCAAGGAA TCCGCTTCCT TCACCGCTCA AGTTATTGTC 850 

TTGAACCACC CTGGTCARAT TGGTGCTGGT TACTCTC CTG TCTTGGATTG 900 

CCACACTGCT CACATTGCAT GCAAGTTCTY TKAGCTTCTT KAGAAGATCG 950 

ATYGTCGTTC CGGTAAGTAA ATANTTTGGT TTRGGATATG GGTATTGGGC 1000 

TTAATCTYTG GATTTTGCCT CAATTGCTCC TTCCTTGATC TTTCTCGATT 1050 

3 0 ACTTTTTGAT CATTTGCTAA TCCAAACCCT TTCCATTTYA TTGAAAACAG 1100 

GTAAGAAGTT GGAAGACTCC CCCAAGTTCG TCAAGWSYGG TGACTCTGCY 1150 

ATCGTCAAGA TGGTTCCTTC CAAGCCCATG TGCGTTGAAG CCTACACTGA 12 0 0 

ATATCCTCCT CTTGGTCGTT TCGCTGTCCG TGA 123 3 



35 



50 



2) INFORMATION FOR SEQ ID NO: 401 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Altexmaria alternata 

(B) STRAIN: ATCC 62099 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 



CAAGTTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTCT 50 

GGAAGTTCGA GACTCCCAAG GTTAGTACCC CTCTGCCTAC TACATCAAGT 100 

55 TCTTTACAAT GCTAACATGT TGTACTCAGT ACTATGTCAC CGTCATTGAC 150 

GCCCCCGGTC ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA 200 

GGCCGACTGC GCTATTCTCA TCATTGCCGC CGGTACTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACTCGTG AGCACGCTCT CCTCGCTTAC 3 00 

ACCCTCGGTG TCAAGCAGCT CATCGTTGCC ATCAACAAGA TGGACACCAC 350 

60 CAAGTGGTCC GAGGAGCGTT ACCAGGAGAT CATCAAGGAG ACCTCCAACT 400 
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TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 45 0 

TCCGGTTTCA ACGGTGACAA CATGATTGAG GCCTCATCCA ACTGCCCCTG 500 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 55 0 

TCCTCGAGGC CATCGACGCC ATCGACCCTY CCAGCCGTCC CACCGACAAG 600 

5 CCCCTCCGTC TTCCCCTYCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 650 

GGTGCCCGTC GGTCGTGTCG AGACCGGTAT CATCAAGGCC GGTATGGTCG 700 

TCACCTTCGC CCCCGCTGGT GTCACCACTG AAGTCAAGTC CGTC GAGATG 750 

CACCACGAGC AGCTCACCGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 80 0 

CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 850 

10 ACTCCAAGAA CGACCCCCCC AAGGGTGCCG AGTCCTTCAA CGCCCAGGTC 9 00 

ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCTGGTTACG CCCCAGTCCT 950 

CGACTGCCAC ACCGCCCACA TTGCTTGCAA GTTCTCTGAG CTCCTCGAGA 1000 

AGATTGACCG CCGTACCGGA AAGTCTGTTG AGAACTCTCC CAAGTTCATC 105 0 

AAGTCCGGTG ACGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1100 

15 CGTTGAGGCT TTCACTGACT ACCCTCCTCT CGGTCGTTTC GCTGTCCGTG 1150 

A 1151 



2 0 2) INFORMATION FOR SEQ ID NO: 402 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 



CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCCA GACCTCCAAG TATGAGGTCA CCGTCATTGG TAAGCATTTG 100 

AGTTCCAACC TACGTTGCCC AACATTTACA GTCATCTAAC AAAGTT CAAT 15 0 

AGATGCCCCC GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACTT 2 00 

40 CCCAGGCTGA CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAATTC 25 0 

GAGGCTGGTA TCTCCAAGGA TGGTCAGACC CGTGAGCACG CTCTGCTCGC 3 00 

TTTCACCCTC GGTGTCCGTC AGCTCATCGT TGCCCTCAAC AAGATGGACA 35 0 

CCTGCAAGTG GTCTCAGGAT CGTTACAACG AAATCGTTAA GGAGACTTCC 400 

AACTTCATCA AGAAGGTCGG ATACAACCCC AAGAGCGTTC CTTTCGTCCC 4 50 

45 CATCTCCGGT TTCAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC 50 0 

CCTGGTACAA GGGTTGGGAG AAGGAGACCA AGGCTGGCAA GTCCACCGGT 550 

AAGACCCTTC TCGAGGCCAT CGATGCCATC GAGCCCCCCG TCCGTCCCAC 60 0 

CGACAAGCCT CTCCGTCTTC CCCTYCAGGA TGTCTACAAG ATCTCTGGTA 650 

TCGGTACTGT GCCCGTCGGT CGTGTCGAGA CTGGTGTCAT CAAGC CTGGT 70 0 

50 ATGGTCGTTA CTTTCGCTCC TGCCAACGTG AC CAC TGAAG TCAAGTCCGT 750 
TGAAATGCAC CACCAGCAGC TCCAGGCCGG TAACCCCGGT GACAACGTTG 
GTTTCAACGT CAAGAACGTC TCCGTCAAGG AAGTCCGCCG TGGTAACGTT 
GCCGGTGACT CCAAGAACGA CCCCCCTGCT GGCTGCGATT CCTTCAACGC 

CCAGGTCATC GTCCTTAACC ACCCCGGTCA GGTCGGCAAC GGTTACGCTC 950 

55 CCGTCCTGGA CTGCCACACC GCTCACATTG CTTGCAAGTT CGCTGAGCTC 1000 

CTTGAGAAGA TTGACCGCCG TACCGGTAAA TCTGTTGAGG ACAAGCCCAA 1050 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATT CCCTCCAAGC 1100 

CCATGTGTGT GGAGTCTTTC ACTGACTTCC CCCCTCTTGG TCGTTTCGCT 1150 

GTCCGTGACG TAAGTTTTTC CCTCTTGACT ATCTTCACAA TTTTTCACAT 12oS 

60 ATTTTCACGC CTCGTCCCAC TCTTTTTCCT CCCTTCCTCT TTGGTTCCCC 12sS 
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TTTTTGCCTG CAAGTTCTCT ATAGCTAACA TGA 12 8 3 



5 2) INFORMATION FOR SEQ ID NO: 403 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1103 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: DAL95 



20 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403 



TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT GGAAGTTCCA 50 

GACTCCCAAG TATGAGGTCA CTGTCATCGG TAAGCTCGAC TCGCCCCGAT 100 

ATGTTTTGGT GCTGTAGCTA ACACGATCTG AAGATGCCCC CGGTCACCGT 150 

GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG ACTGCGCTAT 2 00 

25 CCTCATCATT GCCTCCGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG 2 50 

ATGGCCAGAC CCGTGAGCAC GCTCTGCTGG CTTTCACCCT CGGTGTCAAG 3 00 

CAGCTCATCG TCGCCCTCAA CAAGATGGAC ACCTGCAAGT GGTC CGAGGA 3 50 

TCGTTACAAC GAAATTGTCA AGGAAACCTC CAACTTCATC AAGAAGGTCG 4 00 

GCTACAACCC CAAGGCCGTT CCCTTCGTCC CCATCTCTGG CTTCAACGGT 450 

3 0 GACAACATGC TTGAGC CCTC CTCCAACTGC CCCTGGTACA AGGGATGGGA 500 

GAAGGAGACC AAGGCCGGCA AGGTCACTGG TAAGACCCTC ATCGAGGCCA 550 

T CGACGC CAT TGAGCCCCCT GTCCGTCCCT CCAACAAGCC CCTCCGTCTT 600 

CCCCTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACGG TCCCTGTCGG 650 

CCGTGTCGAG ACCGGTATCA TCAAGCCCGG CATGGTCGTC ACCTTCGCCC 700 

3 5 CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACCAGCAG 750 

CTCCAGGAGG GTGTCCCCGG TGACAACGTC GGTTTCAACG TCAAGAACGT 8 00 

TTCCGTCAAG GAAGTCCGCC GTGGTAACGT CTGCGGTGAC TCCAAGAACG 850 

ATCCCCCTCA GGGTGCTGCC TCCTTCAACG CCCAGGTCAT CGTCCTCAAC 900 

CACCCCGGTC AGGTCGGCGC TGGTTACGCC CCCGTCCTCG ACTGCCACAC 950 

40 TGCC CACATT GCTTGCAAGT TCTCTGAGCT GCTTGAGAAG ATTGACCGCC 100 0 

GTACCGGCAA GTCTGTTGAG AACAACCCCA AGTTCATCAA GTCCGGTGAT 1050 

GCCGCCATCG TGAAGATGGT TCCTTCCAAG CCCATGTGTG TCGAGTCCTT 110 0 

CAC 1103 



2) INFORMATION FOR SEQ ID NO: 4 04 



(i) SEQUENCE CHARACTERISTICS: 
5 0 (A) LENGTH: 114 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 04 

AAGCTCAAGT CCGAGCGTGA GCGTGGTATC AC CAT CGAC A TTGCCCTCTG 50 

GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

5 CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 15 0 

GGTCAC CGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 2 00 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 2 50 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 3 00 

GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 350 

10 GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 400 

AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 450 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 500 

GGGATGGGAG AAGGAGACCA AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

TCGAGGCCAT CGACGC CATT GAGCCCCCTG TCCGTCCCTC CAACAAGCCC 60 0 

15 CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 65 0 

CCCTGTCGGC CGTGTCGAGA C CGGTAT CAT CAAGCCCGGC ATGGTCGTCA 70 0 

CCTTCGCCCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 8 00 

CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 85 0 

2 0 CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 95 0 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 10 0 0 

TTGACCGCCG TAC CGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

25 CGAGTCCTTC ACTGACTACC CCCCTCTGGG TCGTTTCGCC GTCCGTGAC 114 9 



30 



2) INFORMATION FOR SEQ ID NO: 405 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 

4 5 CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 

GGAAGTTCCA GACTGGCAAG TATGAGGTCA CCGTCATTGG TATGTACTCA 
CAGAGTTCTC TTTTCATCAA AGCAATATAC TAACGTCCAT CATAGACGCC ISO 
CCCGGTCACC GTGACTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC 2 00 

TGACTGCGCT ATCCTCATCA TTGCCTCCGG TAC TGGTGAG TTCGAGGCTG 25 0 

5 0 GTATCTCCAA GGATGGCCAG ACTCGTGAGC ACGCTCTGCT TGCTTTCACC 

CTCGGTGTCC GCCAGCTCAT CGTTGCCCTC AACAAGATGG ACACCTGCAA 
GTGGTC CGAG GACCGTTACA ACGAAATCGT TAAGGAGACC TCCAACTTCA 
TCAAGAAGGT CGGATACAAC CCCAAGGGTG TTCCTTTCGT CCCCATCTCC 
GGTTTCAACG GTGACAACAT GCTCGAGCCC TCCCCCAACT GCCCCTGGTA 
55 CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG CAAGGTCACC GGTAAGACCC Xkcs 
TCCTTGAGGC CATCGACGCC ATCGAGCCCC CCGTCCGTCC CTC CAACAAG fiOO 
CCCCTCCGTC TTCCCCTCCA GGATGTCTAC AAGATCTCCG GTATTGGAAC III 
TGTTCCCGTC GGTCGTGTCG AGAC CGGTAT CATTGCCCCT GGTATGGTCG 700 

CXTHi f mr* , , -rv *~ — ™ - • - - - - - - 

800 



50 
100 



300 
350 
400 
450 



TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAGATG 7Sn 
60 CACCACCAGC AGCTCAAGGA AGGTGTCCCC GGTGACAACG TTGGTTTCaS 



374 



CA 02283458 1999-09-28 



10 



25 



CGTCAAGAAC GTTTCCGTCA AGGAGGTTCG CCGTGGTAAC GTTGCCGGTG 850 

ACTCCAAGAA CGACCCCCCT CTTGGCTGTG AGAGCTTCAC CGCCCAGGTC 900 

ATCGTCCTCA ACCACCCCGG TCAGGTCGGC GCTGGTTACG CTCCCGTCCT 950 

GGACTGCCAC ACTGCTCACA TTGCTTGCAA GTTCGCTGAG CTCCTTGAGA 100 0 

AGATTGACCG CCGTACCGGA AAGTCTGTTG AATCTTCCCC CAAGTTCATC 1050 

AAGTCCGGTG ACGCTGCCAT CGTCAAGATG ATTCCCTCCA AGCCCATGTG 1100 

TGTTGAGGCT TTCACTGACT ACCCCCCTCT TGGTCGTTTC GCCGTCCGCG 1150 

A 1151 



2) INFORMATION FOR SEQ ID NO: 406 



( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 10 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406 



GCTTAAAGCT GAACGTGAAC GTGGTATCAC CATTGATAT C GCTCTCTGGA 50 

AGTTCGAAAC TCCTAAGTAC TACGTTACTG TTATTGATG C TCCAGGTCAC 100 

3 0 CGTGATTTCA TCAAGAACAT GATTAC TGGT ACTTCCCAAG CCGATTGCGC 150 

CATTCTTATC ATTGCTGCCG GTGTCGGTGA ATTCGAAGCT GGTATCTCCA 200 

AGGAAGGTCA AACCAGAGAA CACGCTCTTC TCGCTTTCAC CCTTGGTGTC 25 0 

AGACAACTTA TCATTGCCAT CAACAAGATG GACTCTGTCA AGTGGGACCA 3 00 

AAAGAGATAC GAAGAAATCG TCAAGGAGGC TTCCAACTTC GTCAAGAAGG 350 

35 TTGGTTACAA CCCCAAGTCT GTTCCATTCG TTCCTATCTC TGGTTGGAAC 4 00 

GGTGACAACA TGTTGGAACC TACCACCAAC GCCCCATGGT ACAAGGGATG 45 0 

GACCAAGGAA ACCAAGGCTG GTGCCACTAA GGGTATGACT CTTATTGAAG 500 

CCATTGACGC CATTGAACCA CCAGTAAGAC CATCCGACAA GCCACTCCGT 550 

CTCCCACTCC AAGATGTTTA CAAGATTGGT GGTATCGGAA CTGTGCCAGT 600 

4 0 CGGCCGTGTC GAAACCGGTA TCATCAAGGC CGGTATGGTC GTTACCTTTG 650 

CTCCACCAAT GGTCACAACT GAAGTTAAGT CCGTTGAAAT GCACCACGAA 700 

CAACTTGCTC AAGGTAACCC AGGTGACAAC GTTGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTT AAGGAAATCA GACGTGGTAA CGTCTGTGGT GACTCCAAGA 8 00 

ACGATCCACC AAAGGGCTGC GAATCTTTCA ACGCTCAAGT TATCGTCTTG 850 

45 AACCACCCTG GTCAAATCTC TGCTGGTTAC TCTCCAGTTC TCGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA GATTCGACGA ACTCCTTGAA AAGATCGACC 950 

GTCGTTCTGG TAAGAAGATT GAAGACTCTC CAAAGTTTGT CAAGTCTGGT 10 0 0 

GATGCCGCTA TCGTCAAGAT GATCCCAACC AAGCCAATGT GCGTTGAAAC 10 50 

CTTCACTGAA TACCCACCAC TTGGTCGTTT CGCCGTCCGT GAT 10 93 

50 

2) INFORMATION FOR SEQ ID NO: 407 

55 { i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 10231D 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 

C TTGGAC AAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 
CTTTGTGGAA ATTCGAAACT C C AAAAT AC C ACGTTACCGT CATTGATGCT 
CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 
TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 
GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 
TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 
ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 
TCAAGAAGGT TGGTTACAAC C CAAAGACTG TTCCATTYGT TCCAATCTCT 
GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 
CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 
TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 
CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 
TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 
TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 
CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 
CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 
ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 
ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 
GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 
AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 
AAATC CGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 
TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 
A 



2) INFORMATION FOR SEQ ID NO: 408 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 

GAAGGCTGAA AGAGAAAGAG GTATCACCAT TGATATCGCT TTGTGGAAAT 
TCGAAACTCC AAAATACCAC GTTACCGTCA TTGATGCTCC AGGTCACAGA 
GATTTCATCA AGAATATGAT CACTGGTACT TCTCAAGCTG ATTGTGCTAT 
TTTGATTATT GCTGGTGGTA CTGGTGAATT CGAAGCCGGT ATTTCTAAGG 
ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACTTT GGGTGTCAAA 
CAATTGATTG TTGCTGTCAA CAAGATGGAC TCTGTCAAAT GGGACAAAAA 
CAGATTT GAA GAAAT CAT CA AGGAAACCTC CAACTTCGTC AAGAAGGTTG 
GTTACAACCC AAAGACTGTT CCATTCGTTC CAATCTCTGG TTGGAATGGT 
GACAACATGA TTGAACCATC CACCAACTGT CCATGGTACA AGGGTTGGGA 
AAAGGAAACC AAATCCGGTA AAGTTACTGG TAAGACCTTG TTAGAAGCTA 
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TTGACGCTAT TGAACCACCA ACCAGACCAA 
CCATTGCAAG ATGTTTACAA GATCGGTGGT 
TAGAGTTGAA ACTGGTATCA TCAAAGCCGG 
CAGCTGGTGT TACCACTGAA GTCAAATCCG 
5 TTGGCTGAAG GTGTTCCAGG TGACAATGTT 
TTCCGTTAAA GAAATTAGAA GAGGTAACGT 
ATCCACCAAA GGGTTGTGAC TCTTTCAATG 
CATCCAGGTC AAATCTCTGC TGGTTACTCT 
TGCCCACATT GCTTGTAAAT TCGACACTTT 
10 GAACTGGTAA GAAATT GGAA GAAAATCCAA 
GCTGCTATCG TCAAGATGGT CCCAACCAAA 
CACTGACTAC CCACCATTAG GTAGATTCGC 



CCGACAAACC ATTGAGATTG 550 

ATTGGTACTG TGCCAGTCGG 600 

TATGGTTGTT ACTTTCGCCC 650 

TTGAAATGCA TCACGAACAA 7 00 

GGTTTCAACG TTAAGAACGT 750 

TTGTGGTGAC TCCAAGAACG 800 

CCCAAGTCAT TGTTTTGAAC 85 0 

CCAGTCTTGG ATTGTCACAC 900 

GGTTGAAAAG ATTGACAGAA 95 0 

AATTCGTCAA ATCCGGTGAT 1000 

CCAATGTGTG TTGAAGCTTT 105 0 

TGTCAGAGA 1089 



15 



2) INFORMATION FOR SEQ ID NO: 409 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1101 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, albicans 

(B) STRAIN: ATCC 56884 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

3 5 TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

40 GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 55 0 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGAT CGGTG GTATTGGTAC 60 0 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

45 TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

50 GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 95 0 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 10 00 

AAATCCGGTG ATGCTGCTAT CGT CAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

A 1101 

55 



60 



2) INFORMATION FOR SEQ ID NO: 410 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 
10 (B) STRAIN: ATCC 60193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 5 0 

15 CTTTGTGGAA ATTCGAAACT C CAAAATAC C ACGTTAC CGT CATTGATGCT 100 

C CAGGTC ACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 15 0 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

2 0 ATGGGACAAA AACAGATTTG AAGAAAT CAT CAAGGAAACC TCCAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 4 50 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 5 00 

TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 55 0 

2 5 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 6 00 

TGTGC CAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 70 0 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 7 50 

CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

3 0 ACT C C AAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

35 TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

AT 1102 



4 0 2) INFORMATION FOR SEQ ID NO : 411 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 90028 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411 



CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 5 0 

CTTTGTGGAA ATTCGAAACT CCAAAATAC C ACGTTAC CGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

60 GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 



378 



CA 02283458 1999-09-28 



TTGGGTGTCA 
ATGGGACAAA 
TCAAGAAGGT 
GGTTGGAATG 
5 CAAGGGTTGG 
TGTTAGAAGC 
C CAT TGAGAT 
TGTGCCAGTC 
TTACTTTCGC 

10 CATCACGAAC 
CGTTAAGAAC 
AC T C CAAGAA 
ATTGTTTTGA 
GGATTGTCAC 

15 AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 
AT 



AACAATTGAT 
AACAGATTTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGACGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGCTGA 
GTTTCCGTTA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ATGCTGCTAT 
TTCACTGACT 



TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAACCA 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTAT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
CGTCAAGATG 
ACCCACCATT 



AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAAGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATCAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



ACTCTGTCAA 3 00 

TCCAACTTCG 3 50 

TCCAATCTCT 400 

GTCCATGGTA 4 50 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 70 0 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCCCAAGTC 850 

CTCCAGTCTT 900 

TTGGTTGAAA 950 

AAAATTCGTC 1000 

AACCAATGTG 105 0 

GCTGTCAGAG 110 0 
1102 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 412 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida duhlinlensis 

(B) STRAIN: NCPF 3108 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412 



CTTGGATAAA 
CTTTGTGGAA 

4 0 CCAGGT CACA 

TGATTGTGCT 
GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 
45 TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 

5 0 TGTGCCAGTC 

TCACTTTTGC 
CATCACGAAC 
CGTTAAGAAT 
ACTC CAAGAA 

55 ATTGTCTTGA 
GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 

60 A 



TTGAAGGCTG 
ATTCGAAACT 
GAGATTTCAT 
ATTTTGATTA 
GGATGGTCAA 
AACAATTGAT 
AACAGATTTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
T ATTGATG CT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGCTGA 
GTTTCTGTCA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ACGCTGCTAT 
TTCACTGACT 



AAAGAGAAAG 
CCAAAATACC 
CAAGAACATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAAGCT 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTGT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
YGTCAAGATG 
ACCCACCATT 



AGGTATCACC 
ACGTTACCGT 
ATCACTGGTA 
TACTGGTGAA 
ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAGGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATTAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



ATTGATATCG 50 

CATTGATGCT 10 0 

CTTCTCAAGC 150 

TTCGAAGCCG 2 0 0 

GGCTTACACT 250 

ACTCTGTCAA 3 00 

TCTAACTTCG 350 

TCCAATCTCT 400 

GTCCATGGTA 4 50 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 
TTGGTTTCAA 750 
GTTTGTGGTG 800 
TGCCCAAGTC 850 
CTCCAGTCTT 900 
TTGGTTGAAA 950 

AAAATTCGTC 1000 

AACCAATGTG 1050 

GCTGTCAGAG 1100 
1101 
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2) INFORMATION FOR SEQ ID NO: 413 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Candida, catenulata 

(B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413 

20 GGACAAGCTT AAGGCTGAGC GTGAGAGAGG TATCACCATT GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCC AAGTACCACG TCACTGTCAT TGACGCCCCC 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACCT CGCAGGCTGA 150 

CTGTGCTATC TTGAT CATTG CTTCCGGTGT CGGTGAGTTC GAGGCTGGTA 200 

TCTCCAAGGA CGGTCAGACC CGTGAGCACG CCTTGTTGGC CTACACCTTG 250 

25 GGTGTCAAGC AGTTGATCGT TGCCATCAAC AAGATGGACT CCGTCAAGTG 3 00 

GGACAAGAAC AGATTCGAGG AGATTGTCAA GGAGACCACC AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCC AAGGCTGTCC CCTTCGTCCC CATCTCTGGC 4 00 

TGGAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC CCTGGTACAA 450 

GGG CTGGGAG AAGGAGACCA AGGC CGGTAA GTCTACCGGT AAGACCTTGT 500 

3 0 TGGAGGCCAT TGACGCCATT GAGCCCCCTA CCAGACCCAC CGACAAGCCC 550 

TTGAGATTGC CCTTGCAGGA TGTCTACAAG ATCGGTGGTA TTGGTACGGT 600 

GCCCGTCGGC CGTGTCGAGA CCGGTGTCAT CAAGCCCGGT ATGGTCGTCA 650 

CCTTCGCCCC CGCTGGTGTC ACCAC TGAAG TCAAGTCCGT CGAGATGCAC 70 0 

C AC GAGCAGT TGTCCGAGGG TGTCCCCGGT GACAACGTTG GTTTCAACGT 750 

35 CAAGAACGTC TCTGTTAAGG AGATCAGACG TGGTAACGTC TGCGGTGACT 80 0 

CCAAGAACGA CCCCCCCATG GGTTGCTCTT CTTTCAACGC CCAGGTTATC 850 

GTGTTGAACC ACCCCGGTCA GATCTCTGCC GGTTACTCTC CCGTCTTGGA 900 

CTGCCACACC GCCCACATTG CTTGCAAGTT CGCCGAGTTG ATCGAGAAGA 950 

TTGACAGACG TACCGGTAAG TCCATGGAGG CTAACCCCAA GTTCGTCAAG 1000 

40 TCTGGTGACG CCGCCATCGT CAAGATGGAG CCCACTAAGC CCATGTGTGT 105 0 

TGAGGCCTTC ACTGACTTCC CTCCTTTGGG TAGATTCGCC GTCAGAGA 109 8 



45 2) INFORMATION FOR SEQ ID NO: 414 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3 94 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414 

60 
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PTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

ct??ctSS£ attcgaaact ccaaaatacc ACGTTACCGT CATTGATGCT 100 

SSotSSS SaGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

SSttgtgct ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

5 OTMTTCtS GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 

SSSgTCA SaCAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

SSSSS ^EAGATTCG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TO^S^ TGOTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

G^GGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

in ?SgGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

SS?S£SS TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGcSgTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

SaStTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

15 SSSSS^ SaTTCGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

aStCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTAYT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

20 Sgattgacaq AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

AT 1102 



25 



2) INFORMATION FOR SEQ ID NO: 415 



( i ) SEQUENCE CHARACTER I STI CS : 
3 0 (A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dublinlensis 

(B) STRAIN: CBS 7987 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 5 0 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

4 5 CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 15 0 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTYG AAGAAATCAT CAAGGAAACC TCTAACTTCG 3 50 

50 TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 6 00 

55 TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 70 0 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

60 ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTAYT CTCCAGTCTT 90 0 
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GGATTGTCAC ACTGCCCACA 
AGATTGACAG AAGAACTGGT 
AAATCCGGTG ACGCTGCTAT 
TGTTGAAGCT TTCACTGACT 
5 AT 



TTGCTTGTAA ATTCGACACT 
AAGAAATTGG AAGAAAATCC 
YGTCAAGATG GTCCCAACCA 
ACCCACCATT AGGTAGATTC 



TTGGTTGAAA 950 
AAAATTCGTC 1000 
AAC CAATGTG 10 50 

GCTGTCAGAG 110 0 

1102 



10 



2) INFORMATION FOR SEQ ID NO: 416 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

( c ) STRANDEDNE S S : Doub 1 e 
15 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 628 94 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416 

25 AATTGAAGGC TGAAAGAGAA AGAGGTATCA CCATTGATAT CGCTTTATGG 50 

AAATT C G AAA CTCCAAAATA CCACGTTACC GTTATTGATG CTCCAGGTCA 100 

CAGAGATTTC AT C AAGAACA TGATTACTGG TACTTCTCAA GCTGATTGTG 15 0 

CTATTTTRAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCT 2 00 

AAGGATGGTC AAAC CAGAGA ACACGCTTTA TTGGCTTACA CCTTAGGTGT 2 50 

30 TAGACAATTG ATTGTTGCCG TCAACAAGAT GGACTCTGTT AAATGGGACA 3 00 

AGGCTAGATT CGAAGAAATC ATCAAGGAAA CCTCTAACTT CGTCAAGAAG 350 

GTTGGTTACA ACCCTAAGAC TGTTCCTTTC GTYCCAATTT CTGGATGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCTCCACCAA CTGTC CAT GG TACAAGGGTT 4 50 

GGGAAAAGGA AACCAAGGCT GGTAAATCTA CTGGTAAGAC TTTGTTAGAA 500 

35 GCCATTGATG C CAT TGAACC ACCAACCAGA CCAACCGAAA AGCCATTGAG 550 

ATTACCATTA CAAGATGTCT AC AAGAT CGG TGGTATTGGT ACTGTGCCAG 6 00 

TCGGTAGAGT TGAAACCGGT GTTATCAAGG GTGGTATGGT TGTTACCTTT 650 

GCCCCAGCCG GTGTCACTAC CGAAGTCAAA TCCGTTGAAA TGC AC CACGA 700 

ACAATTAGCT GAAGGTGTTC CAGGTGACAA TGTTGGTTTC AACGTCAAGA 75 0 

4 0 ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTTTGTGG TGACTC CAAG 80 0 

AACGACCCAC CAAAGGGTGC TGAATCTTTC ACCGCTCAAG TTATTGTCTT 85 0 

GAACCACCCA GGTCARATCT CTGCTGGTTA CTCTCCAGTC TTAGATTGTC 90 0 

ACACCGCCCA CATTGCTTGT AAATTCGATG CTTTACTCGA AAAGATTGAC 950 

AGAAGATCCG GTAAGAAATT AGAAGACGAA CCAAAATTCG TCAAGTCCGG 10 0 0 

45 TGATGCTGCT ATCGTCAAGA TGGTCCCAAC CAAACCAATG TGTGTTGAAG 10 50 

CTTTCACTGA ATACCCACCA TTAGGTAGAT TCGCTGTTAG AGAT 10 94 



5 0 2) INFORMATION FOR SEQ ID NO: 417 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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30 



45 



2) INFORMATION FOR SEQ ID NO: 418 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1080 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 

(B) STRAIN: ATCC 626 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418 



(A) ORGANISM: Candida glabra ta 

(B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417 

5 TTTGGACAAG TTGAAGGCTG AAAGAGAAAG AGGTATCACT ATCGATATCG 50 

CTTTGTGGAA GTTCGAAACT CCAAAGTACC ACGTYACCGT TATCGATGCY 100 

SIgGt2aCA GAGATTTCAT CAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

SSottoct Scttgatta TTGCTGGTGG TGTCGGTGAA TTCGAAGCYG 200 

10 gS?S?2S ^ATGGTCAA ACCAGAGAAC ACGCTCTATT GGCTTTCACC 250 

CTAGGTGTTA GACAAT TG AT TGTYGCTGTC AACAAGATGG ACTCTGTCAA 300 

CTGGGATGAA TCCAGATTCG CTGAAATCGT TAAGGAAACC TCCAACTTCA 3 50 

SXg^GGT CGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 400 

SSSSScS GTGACAACAT GATTGAAGC C ACCACCAACG CTTCCTGGTA 450 

15 CAAGGGTTGG GAAAAGGAAA CCAAGGCTGG TGTCGTCAAG GGTAAGACCT 5 00 

TGTTGGAAGC CATTGACGCT AT CGAAC C AC CAACCAGACC AACTGACAAG 550 

CCATTGAGAT TGCCATTGCA AGATGTCTAC AAGATCGGTG GTATCGGTAC 6 00 

GGTGCCAGTC GGTAGAGTCG AAACCGGTGT CATCAAGCCA GGTATGGTTG 6 50 

TTACCTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 7 00 

2 0 CACCACGAAC AATTGACTGA AGGTTTGCCA GGTGACAACG TTGGTTTCAA 750 

CGTTAAGAAC GTTT CCGTTA AGGAAATCAG AAGAGGTAAT GTCTGTGGTG 8 00 

ACTCCAAGAA CGACCCACCA AAGGCTGCTG CTTCTTTCAA CGCTACCGTC 850 

ATTGTCTTGA ACCACCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTTTT 900 

GGACTGTCAC ACCGCCCACA TTGCTTGTAA GTTC GAAGAA TTGTTGGAAA 95 0 

2 5 AGAACGACAG AAGATCCGGT AAGAAGTTGG AAGACTCTCC AAAGTTCTTG 1000 

AAGTCCGGTG ACGCTGCTTT GGTTAAGTTC GTTCCATCCA AGCCAATGTG 1050 

TGTCGAAGCT TTCTCCGACT ACCCACCATT GGGTAGATTC GCTGTCAGAG 110 0 



AGAGAAAGAG GTATCACCAT TGACATTGCT TTGTGGAAAT TCGAGACTCC 50 

AAAGTAC CAC GTTACYGTCA TTGATGCCCC AGGTCACAGA GATTTCATCA 100 

50 AGAACATGAT CACTGGTACT TCTCAAGCTG ACTGTGCTAT TTTGATTATT 150 

GCTGGTGGTA CCGGTGAATT CGAAGCTGGT ATCTCTAAGG ATGGTCAAAC 2 00 

CAGAGAGCAC GCTTTGTTGG CTTACACCTT GGGTGTTAGA CAATTGATTG 2 50 

TTGCTGTCAA CAAGATGGAC TCCGTCAART GGGACAAGAA CAGATTYGAG 3 00 

GAAATCATCA AGGAAACCTC TAACTTCGTC AAGAAGGTTG GTTACAACCC 350 

55 TAAGACTGTG CCATTCGTTC CTATCTCTGG ATGGAAYGGT GACAACATGA 4 00 

TTGAGGCTTC TACCAACTGT CCTTGGTACA AGGGATGGGA GAAGGAGACC 45 0 

AAGGCTGGTA AGTCCACCGG TAAGACTTTG TTGGAGGCCA TTGACGC CAT 500 

TGAGCCACCT CAAAGACCAA CCGACAAGCC ATTGAGATTG CCATTGCAAG 550 

ATGTYTACAA GATTGGTGGT ATTGGAACGG TGCCAGTCGG TAGAGTTGAA 600 

60 AC CGGTATCA TYAAGGCCGG TATGGTTGTT ACCTTTGCCC CAGCTGGTGT 650 
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YACCACTGAA GTCAAGTCCG TGGAAATGCA CCACGAACAA TTGGTTGAAG 70 0 

GTGTTCCAGG TGACAATGTT GGTTTCAACG TTAAGAACGT TTCCGTTAAG 750 

GAAATTAGAA GAGGTAACGT TTGTGGTGAC TCCAAGAACG ACCCACCAAA 800 

GGGTTGTGAC TCTTTCACCG CTCAAGTTAT TGTGTTGAAC CACCCTGGTC 850 

5 AAATCTCTGC TGGTTACTCT CCAGTTTTGG ACTGTCACAC CGCCCACATT 900 

GCTTGTAAAT TCGACACCTT GTTGGAGAAG ATTGACAGAA GAACCGGTAA 950 

GAAGATGGAG GACAACCCCA AGTTTGTCAA GTCCGGTGAC GCTTCTATCG 1000 

TCAAGATGGT GCCATCCAAG CCAATGTGTG TTGAGGCTTT CACCGACTAC 1050 

CCACCATTGG GAAGATTCGC CGTCAGAGAC 108 0 

10 



2) INFORMATION FOR SEQ ID NO: 419 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 
25 (B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419 

TCTGTCAAGT GGGACAAGGC CAGATACGAG GAAATCGT CA AGGAGACCTC 50 

30- TAACTTCGTC AAGAAGGTTG GTTACAACCC TAAGACTGTT CCATTCGTCC 100 

CAATCTCTGG TTGGAACGGT GACAACATGA TTGAGGCTTC TACCAACTGT 150 

GAC TGGTACA AGGGTTGGGA GAAGGAGACC AAGTCTGGTA AGTCCACCGG 200 

TAAGAC CTTG TTGGAGGCCA TTGACGCCAT TGAGCCACCA ACCAGACCAA 250 

CCGACAAGCC ATTGAGATTG CCATTGCAGG ATGTCTACAA GATTGGTGGT 3 00 

3 5 ATCGGAACTG TGCCAGTCGG CAGAGTTGAG ACCGGTGTTA TCAAGGCCGG 3 50 

TATGGTTGTC ACCTTCGCCC CAGCTGGTGT CACCACTGAA GTCAAGTCTG 4 00 

TCGAGATG C A CCACGAGCAG TTGC CAGAGG GTGTCCCAGG TGACAACGTT 450 

GGTTTCAACG TCAAGAACGT TTCCGTTAAG GAAATCAGAA GAGGTAACGT 500 

CTGTGGTGAC TCCAAGCAGG ACCCACCAAA GGG CTGTGAC TCTTTCACCG 550 

40 CTCAGGTTAT TGTGTTGAAC CACCCAGGTC AGATCTCTTC TGGTTACTCT 60 0 

CCAGTTTTGG ACTGTCACAC TGCCCACATT GCTTGTAAGT TCGACACCTT 650 

GGTTGAGAAG ATCGACAGAA GAACCGGTAA GAAGTTGGAA GATGAGCCAA 700 

AGTTCATCAA GTCCGGTGAC GCTGCTATCG TCAAGATGGT CCCAACCAAG 750 

C 751 

45 



2) INFORMATION FOR SEQ ID NO: 420 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) ' TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 
60 (B) STRAIN: ATCC 16783 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420 

TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 5 0 

5 CTTTATGGAA ATTCGAAACT CCAAAGTATC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCTCAAGC 15 0 

AGATTGTGCT ATTTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 2 00 

GTATTTC CAA GGATGGTCAA ACTAGAGAAC ACGCTTTATT AGCATTCACC 2 50 

TTAGGTGTTA AGCAATTGAT TGTTGCTATC AACAAGATGG ATTCTGTTAA 3 00 

10 GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAAC C CAAAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCTTCTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTCAAG GGTAAGACCT 500 

TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGATAAG 550 

15 CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 60 0 

TGTGCCAGTC GGTAGAGTTG AAAC CGGTAT TATTAAGCCA GGTATGGTTG 650 

TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAAGA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

CGTCAAGAAC GTCTCTGTTA AGGAT AT CAA GAGAGGTAAC GTTTGTGGTG 800 

2 0 AC TCCAAGAA TGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCTGGTTACT CTCCAGTTTT 900 

AGATTGTCAC ACTGCCCACA TTGCATGTAA ATTCGATGAA TTAATTGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

2 5 TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

AT 1102 



30 2) INFORMATION FOR SEQ ID NO: 421 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

( D ) TO POLOG Y : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 



TGGACAAGTT AAAGGCTGAA AGAGAAAGAG GTATCAC CAT CGATATCGCT 50 

TTGTGGAAGT TCGAAACTCC AAAGTAC CAA GTTACCGTTA TCGATGCTCC 10 0 

AGGTCACAGA GATTTCATCA AGAACATGAT TACTGGTACT TCTCAAGCTG 150 

ACTGTGCTAT CTTGATTATT GCTGGTGGTG TCGGTGAATT CGAAGCCGGT 2 00 

5 0 ATCTCCAAGG ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACCTT 250 

GGGTGTTAGA CAATTGATTG TTGCTATCAA CAAGATGGAC TCTGTTAAGT 3 00 

GGGATGAATC TCGTTAC CAA GAAATTGTTA AGGAAACCTC CAACTTCATC 3 50 

AAGAAGGTCG GTTACAACCC AAAGAATGTT CCATTCGTCC CAATCTCTGG 4 00 

TTGGAACGGT GACAACATGA TTGAAGCCAC CACCAACGCT C C ATGGTAC A 450 

55 AGGGTTGGGA AAAGGAAACC AAGGCTGGTA CCGTCAAGGG TAAGAC CTTG 500 

TTGGAAGCTA TTGACGCTAT CGAACCACCA AC C AGAC CAA CTGACAAGCC 55 0 

ATTGAGATTG CCATTGCAAG ATGTCTACAA GATCGGTGGT ATTGGTACTG 60 0 

TGCCAGTCGG TAGAGTCGAA ACCGGTGTCA TCAAGC CAGG TATGGTTGTT 650 

ACCTTCGCCC CAGCCGGTGT CACTACCGAA GTTAAGTCCG TCGAAATGCA 700 

6 0 CCACGAACAA TTGGAAGAAG GTCTACCAGG TGACAACGTC GGTTTCAACG 750 



385 



CA 02283458 1999-09-28 



TCAAGAACGT 
TCCAAGAACG 
CGTCTTGAAC 
ATTGTCACAC 
AACGACAGAA 
GTCTGGTGAC 
TTGAAGCATT 



TTCCGTTAAG 
ATCCACCAAA 
CACCCAGGTC 
TGCTCACATT 
GATCCGGTAA 
GCTGCTTTGG 
CTCTGACTAC 



GAAATCAGAA 
GGCTGCTGCT 
AAATCTCTGC 
GCTTGTAAGT 
GAAGTTGGAA 
TTAAGTTCGT 
CCACCATTGG 



GAGGTAACGT 
TCTTTCAACG 
TGGTTACTCT 
TCGACGAATT 
GACTCTCCAA 
TCCATCTAAG 
GTAGATTCGC 



CTGTGGTGAC 8 00 

CCACTGTTAT 850 

CCAGTTTTGG 900 

GTTGGAAAAG 950 

AGTTCTTGAA 10 0 0 

CCAATGTGTG 1050 

TGTCAGAGA 1099 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 422 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNE S S : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 



AAGTTAAAGG 
GAAGTTYGAA 
ACAGAGATTT 
GCTATTTTGA 
CAAGGATGGT 
TTAGACAATT 
GAAAACAGAT 
GGTTGGTTAC 
ATGGTGACAA 
TGGAC TAAGG 
AGCAATCGAT 
GATTACCATT 
GTCGGTAGAG 
TGCTCCAGCA 
AACAATTAGA 
AACGTTTCTG 
GAACGACCCA 
TGAACCACCC 
CACACTGCCC 
CAGAAGAACT 
GTGATGCAGC 
GCTTTCACTG 



CAGAAAGAGA 
ACTCCAAART 
CATCAAGAAC 
TTATTGCTGG 
CAAACTAGAG 
GATTGTTGCT 
TTGAAGAAAT 
AACCCAAAGA 
CATGATTGAA 
AAAC CAAGGC 
GCTATTGAAC 
ACAAGATGTT 
TCGAAACCGG 
GGTGTCACCA 
ACAAGGTGTT 
TCAAGGATAT 
CCAATGGGTG 
TGGTCAAATT 
ACATTGCATG 
GGTAAGTCTG 
TATCGTCAAG 
AATAYCCACC 



AAGAGGTATC 
ACCACGTTAC 
ATGATTACCG 
TGGTGTCGGT 
AACACGCTCT 
AT CAAC AAGA 
TGTCAAGGAA 
CTGTTCCATT 
GCATCCACCA 
AGGTGTTGTT 
CACCTGTCAG 
TACAAGATTG 
TGTCATTAAG 
CCGAAGTCAA 
CCAGGTGATA 
CAAGAGAGGT 
CAGCTTCCTT 
TCCGCTGGTT 
TAAGTTC GAC 
TTGAAGACCA 
ATGGTCCCAA 
ATTAGGTAGA 



ACTATTGATA 
CGTTATTGAT 
GTACTTCTCA 
GAATT CGAAG 
ATTGGCTTTC 
TGGATTCCGT 
ACCCAAAACT 
CGTTC CAATY 
ACTGTCCATG 
AAGGGTAAGA 
ACCAACCGAA 
GTGGTATTGG 
CCAGGTATGG 
GTCCGTTGAA 
ACGTTGGTTT 
AACGTTTGTG 
CAATGCTCAA 
ACTCTCCAGT 
GAATTAAT CG 
TCCAAAGTCY 
CCAAGCCAAT 
TTCGCAGTCA 



TTGCTTTATG 5 0 

GCTCCAGGTC 100 

AGCTGATTGT 15 0 

C TGGTATCTC 200 

AC CTTAGGTG 250 

TAARTGGGAT 3 0 0 

TCATCAAGAA 350 

TCTGGTTGGA 4 00 

GTACAAGGGT 4 50 

CCTTATTAGA 5 00 

AAGCCATTAA 55 0 

TACTGTGC C A 600 

TTGTCACTTT 65 0 

ATGCACCATG 70 0 

CAACGTTAAG 75 0 

GTGACTCCAA 80 0 

GTCATTGTCT 85 0 

CTTGGATTGT 90 0 

AAAAGATTGA 950 

GTCAAGTCTG 10 00 

GTGTGTTGAA 1050 

GAGAT 1095 



50 



55 



2) INFORMATION FOR SEQ ID NO: 423 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 
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( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Candida lambxca 

(B) STRAIN: ATCC 24750 

° (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423 

PTTGGACAAG CTTAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG SO 

CTTTATGGAA GTTCGAAACT CCAAAGTACC ACGTTACCGT CATTGACGCT 100 

m PPAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCTCAAGC 150 

aga??gtcS ATTTTRATYA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 200 

g"ta?ct?taa gStggtcaa ACCAGAGAAC ACGCTCTTCT TGCATTCACT 250 

CTTGGTGTTA GACAATTGAT TGTTGCTATC AACAAGATGG ACTCTGTCAA 300 

SgSaSaA TCCAGATTCG ATGAAATTTG TAAGGAAACC GCWAACTTCA 3 50 

15 5JAAGAJSS tSgTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 400 

GGTTGGAACG GTGACAACAT GATTGAACCA TCTGCTAACT GTCCATGGTA 450 

CAAGgSt"gG ACTAAGGAAA CCAAGGCTTC CGGTGTCGTC AAGGGTAAGA 500 

CCCTTCTTGA AGCAATTGAT GCTATTGAGC CACCTGTCAG ACCAACTGAC 550 

AAG^SgA GATTGCCATT RCAAGATGTC TACAAGATTG GTGGTATTGG 600 

?Sc?G?GCCA GTCGGTAGAG TTGAAACCGG TATCATCAAG CCAGGTATGA 650 

ggJSSSS ScTCCAACC GGTGTTACTA CTGAAGTTAA GTCCGTTGAA 700 

ATrrACCATG AACAATTAGA AGAAGGTGTC CCAGGTGACA ATGTTGGTTT 7 50 

CAaS53£g AACGTCTCTG TTAAGGATAT TAAGAGAGGT AACGTCTGTG 800 

S?Sac?CCAA GAACGACCCA CCAATGGGTT GTGCTTCCTT CAATGCTCAA 850 

G^StTGTTC TTAACCACCC AGGTCAAATT TCTGCTGGTT ACTCACCAGT 900 

TCTTGACTGT CACACTGCCC ACATTGCATG TAAGTTCGAY GAATTA.CTCG 950 

AAAAGATTGA CAGAAGAACC GGTAAGGCTA CTGAAGAC CA TCCAAAGTCT 1000 

GTCAAGTCTG GTGATGCAGC TATCGTCAAG ATGGTTCCAA CCAAGCCAAT 1050 

GTGTGTYGAA GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTYA 1100 
30 GAGA 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 424 

35 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (a) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 

50 GGACAAGTTG AAGGCTGAGA GAGAAAGAGG TATCACCATC GATATCGCTT 5 0 

TGTGGAAGTT CGAGACTCCA AAGTACCACG TTACCGTCAT TGACGCTCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAAGCTGA 150 

CTGTGCTATC TTGATTATCG CTGGTGGTGT CGGTGAGTTC GAAGCCGGTA 200 

TCTCTAAGGA CGGTCAAACC AGAGAGCACG CTTTGTTGGC TTACACCTTG 250 

55 GGTGTCAAGC AGTTGATTGT TGCTGTCAAC AAGATGGACT CCGTCAAGTG 3 00 

GGACCAATCT AGATTCGAGG AAATCATCAA GGAAACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACTGTTC CATTCGTCCC AATCTCTGGT 400 

TGGAACGGTG ACAACATGAT TGAGCCATCY ACCAACTGCC CATGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGTCYGGTAA GTCCACCGGT AAGACCTTGT 500 

60 TGGAGGCCAT TGACGC CATT GAGCCACCTT CGAGACCAAC CGACAAGCCA 550 
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TTGAGATTGC CATTGCAAGA 
GCCAGT CGGT AGAGTTGAGA 
CCTTTGCTCC AGCTGGTGTC 
CACGAACAAT TGGCTGAGGG 
5 CAAGAACGTT TCCGTCAAGG 
C C AAGAACGA CCCACCAAAG 
GTCTTGAACC ACCCAGGTCA 
CTGTCACACT GCYCACATTG 
TCGACAGAAG AACYGGTAAG 
10 TCYGGTGACG CTGCTATCGT 
YGAAGCTTTC ACCGACTACC 



TGTCTACAAG ATYGGTGGTA 
CCGGTGTCAT CAAGGCCGGT 
ACCACTGAAG TCAAGTCCGT 
TGTCCCAGGT GACAACGTTG 
AAATCAGAAG AGGTAACGTC 
GCTGCTGCTT CYTTCACTGC 
AATCTCCTCY GGTTACTCTC 
CTTGTAAGTT CGACAC CTTG 
AAGTTGGAAG AAG AGC CAAA 
CAAGATGGTC CCAACCAAGC 
CACCATTGGG TAGATTCGCT 



TTGGTACTGT 6 0 0 

ATGGTTGTCA 650 

GGAAATGCAC 70 0 

GTTTCAACGT 75 0 

TGTGGTGACT 800 

TCAAGTYATY 850 

CAGTYTTGGA 90 0 

ATTGAGAAGA 95 0 

GTTCATCAAG 10 0 0 

CAATGTGYGT 105 0 

GTCAGAGA 1098 



15 2) INFORMATION FOR SEQ ID NO: 425 
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50 



( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425 

TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT 
CTTTATGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT 
CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA 
TGATTGTGCT ATCT TAATTA TTGCTGGTGG TGT CGGTGAA 
GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTTTATT 
TTAGGTGTTA AGCAATTAAT TGTTGCTATC AACAAGATGG 
GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC 
TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT 
GGTTGGAATG GTGACAACAT GATTGAACCA TCTACTAACT 
CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTTAAG 
TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC 
CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG 
TGTGCCAGTC GGTAGAGTTG AAACC GGTGT TATTAAGCCA 
TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC 
CACCATGAAC AATTAGAACA AGGTGTCC CA GGTGACAATG 
CGTCAAGAAC GTCTCTGTTA AGGATATTAA GAGAGGTAAC 
ACTCCAAGAA CGACCCACCA CAAGGTTGTG CTTCCTTCAA 
ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCAGGTTACT 
AGATTGT CAC ACTGCCCACA TTGCATGTAA GTTCGATGAA 
AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC 
AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTC CAACCA 
TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC 
A 



ATTGATATTG 50 

TATTGATGCT 100 

CTTCCCAAGC 150 

TTCGAAGCTG 200 

AGCATTCACC 250 

ACTCTGTTAA 3 00 

CAAAACTT CA 350 

TCCAATTTCT 4 00 

GTC CATGGTA 450 

GGTAAGACCT 500 

AACTGACAAG 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCTCAAGTC 850 

CTCCAGTTTT 900 

TTAATCGAAA 950 

AAAGTCTGTT 1000 

AGCCAATGTG 1050 

GCAGTCAGAG 110 0 
1101 



55 



60 



2) INFORMATION FOR SEQ ID NO: 42 6 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1095 bases 
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20 



25 



45 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426 



CAAATTGAAG GCTGAAAGAG AAAGAGGTAT CACCATTGAT ATCGCTTTGT 50 

GGAAATTCGA AACTCCAAAA T AC CATGTTA CTGTTATTGA TGCTCCAGGT 100 

15 S^GATT TCATCAAGAA TATGATTACT GGTACTTCTC AAGCTGATTG 150 

TGCTATTTTG ATTATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

SSggatgg TCAAACCAGA GAACACGCTT TGTTGGCTTA CACCTTGGGT 2 50 

StTAAGCAAT TGATTGTTGC CATCAACAAG ATGGACTCAG TCAAATGGGA 300 

CAAGAACAGA TACGAAGAAA TTGTCAAGGA AACTTCCAAC TTCGTCAAGA 350 

aSStTGGTTA CAACCCTAAA GCTGTCCCAT TCGTCCCAAT CTCTGGTTGG 4 00 

AACGGTGACA ATATGATTGA ACCATCAACC AACTGTCCAT GGTACAAGGG 450 

TTGGGAAAAG GAAACTAAAG CTGGTAAGGT TACCGGTAAG ACCTTGTTGG 50 0 

AAGCTATCGA TGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG 550 

AGATTGCCAT TGCAAGATGT CTACAAGATT GGTGGTATTG GAACTGTGCC 60 0 

AGTTGGTAGA GTTGAAACCG GTATCATCAA GGCTGGTATG GTTGTTACTT 650 

TTGCCCCAGC TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCAC CAC 700 

GAACAATTGA CTGAAGGTGT CCCAGGTGAC AATGTTGGTT TCAACGTCAA 7 50 

GAACGTTT CA GTTAAGGAAA TCAGAAGAGG TAACGTTTGT GGTGACTCCA 800 

AGAACGATCC ACCAAAGGGA TGTGAYTCCT TCAATGCTCA AGTTATTGTC 8 50 

3 0 TTGAAC CAC C CAGGTCAAAT CTCTGCTGGT TACT CACCAG TCTTGGATTG 90 0 

TCACACTGCC CACATTGCTT GTAAATTCGA CACTTTGATT GAAAAGATTG 950 

ACAGAAGAAC CGGTAAGAAA TTGGAAGATG AACCAAAATT CATCAAGTCC 100 0 

GGTGATGCTG CYATCGTCAA GATGGTC CCA ACCAAGCCAA TGTGTGTTGA 1050 

AGCTTTCACT GACTACCCAC CATTGGGAAG ATTCGCTGTT AGAGA 10 95 



35 



2) INFORMATION FOR SEQ ID NO: 427 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugrosa 
50 (B) STRAIN: ATCC 96275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427 

CTCCGTCAAG TGGTCTCAGT CTCGTTTCGA GGAGATCGTC AAGGAGGTTT 50 

55 C CAACTT CAT CAAGAAGGTT GGTTACAAGC CCGATGAGGT TCCTTTCGTC 100 

CCCATCTCTG GCTGGAACGG CGACAACATG CTTGAGCCCT CCACCAACTG 150 

CCCCTGGTAC AAGGGATGGA CCAAGAAGAC CAAGAAGGGT GAGGTCAAGG 2 00 

GTAAGACTCT TCTCGAGGCC ATTGAC GC C A TCGAGCCCCC CTCCCGTCCT 2 50 

ACCGACAAGC CCCTCCGCTT GCCTCTTCAG GATGTCTACA AGATCGGCGG 3 00 

60 TATCGGTACG GTACCTGTCG GCCGTGTCGA GACCGGTATC ATCAAGCCCG 3 50 
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GCATGGTCGT CACTTTCGCC CCCGCTGGTG 
GTCGAGATGC ACCACGAGCA GATCCCCGAG 
CGGTTTCAAC GTCAAGAACG TTACCGTCAA 
TCTGCGGTGA CTCCAAGAAC GACCCCCCCA 
5 GCCCAGGTCA TCGTTTTCAA CCACCCCGGT 
CCCCGTTTTG GACTGCCACA CCGCCCACAT 
TCCAGTCCAA GATGGACCGT CGTACTGGTA 
AAGTTCATCA AGGCTGGTGA CTCCGCTATC 
GC 



TCACCACTGA AGTGAAGTCC 4 00 

GGTCTCCCCG GTGACAACGT 4 50 

GGATATCCGC CGTGGTAACG 500 

AGGGCTGCTC TTCCTTCACT 550 

CAGATCTCCA ACGGTTACTC 6 00 

TGCCTGCCGC TTCGACGAGA 65 0 

AGACCCTTGA GGAGAACCCC 700 

GTCAAGATGG TTCCCTCCAA 750 

752 



2) INFORMATION FOR SEQ ID NO: 428 

15 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 
25 (B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428 

AGTTAAAGGC TGAAAGAGAA AGAGGTATCA CCATCGATAT CGCTTTGTGG 50 

3 0 AAGTTCGAAA CTC CAAAGTA CCAAGTTACC GTTATCGATG CTCCAGGTCA -100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGACTGTG 150 

CTATCTTGAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCC 2 00 

AAGGATGGTC AAACCAGAGA ACACGCTTTG TTGGCTTTCA CCTTGGGTGT 250 

TAGACAATTG ATTGTTG CTG T T AACAAG AT GGATTCCGTT AAGTGGGATG 3 00 

35 AATCTCGTTT CCAAGAAATT GTCAAGGAAA CCTCTAACTT CATCAAGAAG 3 50 

GTCGGTTACA ACC CAAAGAC TGTTCCATTC GTCCCAATCT CTGGTTGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCACCACCAA TGCTTCATGG TACAAGGGTT 4 50 

GGGAAAAGGA AACCAAGTCC GGTGTCGTCA AGGGTAAGAC CTTGTTGGAA 5 00 

GCTATTGACG CTATCGAACC ACCATCCAGA CCAACTGACA AGCCATTGAG 550 

40 ATTGCCATTG CAAGATGTCT ACAAGATTGG TGGTATCGGA ACTGTGCCAG 60 0 

TCGGTAGAGT CGAAACCGGT GTTATCAAGC CAGGTATGAT TGTTACCTTT 650 

GCCCCAGCCG GTGTTACTAC TGAAGTTAAG TCCGTCGAAA TGCACCACGA 7 00 

ACAATTGGAA GAAGGTCTAC CAGGTGACAA CGTCGGTTTC AACGT CAAGA 75 0 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTCTGTGG TGACTCCAAG 8 00 

45 AACGATCCAC CAAAGGCTGC TGCTTCTTTC AACGCCACTG TTATCGTCTT 850 

' GAACCATC CA GGTCAAATCT CTGCTGGTTA CTCTCCAGTT TTGGATTGTC 900 

ACACTGCTCA CATTGCTTGT AAGTT CGACG AATTGTTGGA AAAGAACGAT 95 0 

AGAAGATCCG GTAAGAAGTT GGAAGACT CT CCAAAGTTCT TGAAGTCCGG 1000 

TGATGCTGCT TTGGTTAAGT TCGTTCCATC TAAGCCAATG TGTGTTGAAG 10 50 

50 CCTTCTCTGA CTACCCACCT CTAGGTAGAT TCGCTGTCAG AGA 1093 



2) INFORMATION FOR SEQ ID NO: 429 

55 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
6 0 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429 

10 AAATTGAAGG CTGAAAGAGA AAGAGGTATC ACCATTGATA TCGCTTTGTG 50 

GAAATTCGAA ACTC CAAAAT ACCACGTTAC CGTTATTGAT GCTCCAGGTC 100 

ACAGAGATTT CATCAAGAAC ATGATTACTG GTACTTCCCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTACTGGT GAATTCGAAG CTGGTATTTC 200 

TAAAGATGGT CAAACCAGAG AACACGCTTT GTTGGCTTAC ACCTTGGGTG 250 

15 TCAAACAATT GATTGTTGCT GTCAACAAGA TGGACTCTGT TAAATGGGAC 3 00 

AAAAACAGAT TTGAAGAAAT TATCAAGGAA ACTTCTAACT TCGT C AAGAA 3 50 

GGTTGGTTAC AACCCTAAGG CTGTTC CATT CGTTCCAATC TCTGGTTGGA 400 

ATGGTGACAA CATGATTGAA GCTTCTACCA ACTGTCCATG GTACAAGGGT 450 

TGGGAAAAAG AAACCAAGGC TGGTAAGGTT ACCGGTAAGA CTTTGTTGGA 500 

2 0 AGCCATTGAT GCTATTGAAC CACCTTCAAG ACCAACTGAC AAGC C ATTGA 550 

GATTGCCATT GCAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

GTCGGTAGAG TTGAAACTGG TGTCATCAAA GCCGGTATGG TTGTTACTTT 65 0 

CGCCCCAGCT GGTGTTACCA CTGAAGTCAA ATCCGTCGAA ATGCACCACG 700 

AACAATTGGC TGAAGGTGT C CCAGGTGACA ATGTTGGTTT CAACGTTAAG 750 

25 AACGTTTCTG TTAAAGAAAT TAGAAGAGGT AACGTTTGTG GTGACTCCAA 800 

GAACGAT C C A CCAAAGGGTT GTGACTCTTT CAACGCTCAA GTTATTGTCT 85 0 

TGAACCACCC AGGTCAAATC TCTGCTGGTT ACTCTCCAGT CTTGGATTGT 90 0 

CACACTGCTC ATATTGCTTG TAAATTCGAC ACCTTGGTTG AAAAGATTGA 950 

CAGAAGAACT GGTAAGAAAT TGGAAGAAAA TCCAAAATTC GTCAAATCCG 10 0 0 

30 GTGATGCTGC TATTGTCAAG ATGGTT CCAA CCAAACCAAT GTGTGTTGAA 1050 

GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTCA GAGA 10 94 



35 2) INFORMATION FOR SEQ ID NO: 430 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 10 95 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 



CAAGCTTAAA GCTGAGAGAG AGAGAGGTAT CACTATCGAC ATTGCTCTCT 5 0 

GGAAGTTCGA GACT CCAAAG TACCACGT T A CTGTCATTGA TGCCCCAGGT 100 

CACAGAGATT TCATCAAGAA CATGATTACT GGTACCTCCC AGGCTGACTG 15 0 

TGCTATTCTT ATCATTGCCG GTGGTGTTGG TGAGTTCGAG GCTGGTATCT 200 

55 CTAAGGATGG TCAGACCAGA GAGCACGCTT TGCTCGCTTT CACCCTTGGT 2 50 

GTTAGACAGA TGATTGTTGC TATCAACAAG ATGGACTCTG TCAAGTGGGA 3 00 

CGAGAAGAGA TTCGAGGAGA TCGTTAAGGA GACCTCTAAC TTCATCAAGA 3 50 

AGGTTGGTTA CAACCCAAAG ACTGTTCCAT TTGTC C CAAT TTCYGGTTGG 4 00 

AACGGTGACA ACATGATTGA GGCCTCTACC AACTGTCCAT GGTACAAGGG 4 50 

6 0 TTGGGAGAAG GAGACCAAGG CTGGTGTTGT CAAGGGTAAG ACCTTGCTCG 500 
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ATGCCATTGA CGCCATTGAG CCACCAACAA GACCAACTGA CAAGCCATTG 55 0 

AGATTGCCAC TCCAGGATGT CTACAAGATT GGTGGTATCG GAACTGTTCC 60 0 

AGTCGGTAGA GTCGAGACCG GTGTCATCAA GCCAGGTATG GTTGTTACCT 650 

TTGCCCCATC CGGTGTCACC ACTGAGGTTA AGTCCGTCGA GATGCACCAC 70 0 

5 GAGCAGCTTG CTGAGGGTAT CCCAGGTGAC AACGTTGGTT TCAACGTTAA 750 

GAACGTCTCT GTTAAGGAGA TCAGAAGAGG TAACGTTGCC GGTGACTCCA 8 00 

AGAACGACCC ACCACAGGGT GCTGAGTCCT TCAACGCTCA GGTCATTGTC 850 

TTGAACCACC CAGGTCAGAT CTCTGCTGGT TACTCTCCAG TTTTGGACTG 900 

TCACACCGCC CACATTGCTT GTAAGTTCTC TGAGCTTTTG GAGAAGATTG 950 

10 ACAGAAGATC CGGTAAGTCC CTTGAGGCCT CTCCAAAGTT CGTCAAGTCT 10 00 

GGTGATGCCG CTATCGTCAA GATGGTTCCA TCCAAGCCAT TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CACTCGGTAG ATTCGCTGTC AGAGA 1095 



15 



25 



55 



2) INFORMATION FOR SEQ ID NO: 431 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1085 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathil 

(B) STRAIN: ATCC 28269 

30 (xi) SEQUENCE DESCRIPTION : ' SEQ- ID NO: 431 

GCTGAAAGAG AAAGAGGTAT CACCATCGAT ATCGCTTTGT GGAAATTCGA 50 

AACTCCAAAR TAC CACGTT A CCGTCATTGA YGCTCCAGGT CACAGAGATT 100 

TCATCAAGAA CATGATYACT GGTACTTCTC AAGCTGATTG TGCTATYTTG 150 

3 5 ATTATCGCTG GTGGTACTGG TGAATTCGAA GC TGGTATYT CTAAGGATGG 2 00 

TCAAACCAGA GAACACGCTT TGTTGGCCTA CACCTTGGGT GTCAAGCAAT 2 50 

TGATTGTTGC TGT CAACAAG ATGGACTCTG TCAAATGGGA CAAGAACAGA 3 00 

TTCGAAGAAA TCATCAAGGA AACCTCCAAC TTCGTCAAGA AGGTTGGTTA 3 50 

CAACCCAAAG ACTGTTCCAT TCGTCCCAAT CTCTGGTTGG AACGGTGACA 4 00 

4 0 ACATGATTGA AGCCTCCACC AACTGCCCAT GGTACAAGGG TTGGGAAAAG 4 50 

GAAACCAAGG CTGGTAAGGT TACCGGTAAG ACTTTGTTGG AAGCCATTGA 500 

CGC TATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG AGATTGCCAT 550 

TGCAAGATGT CTACAAGATT GGTGGTATCG GAACTGTGCC AGTCGGTAGA 60 0 

GTTGAAACTG GTGTCATCAA GGCCGGTATG GTTGTCACTT TYGCCCCAGC 650 

45 TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC GAACAATTGG 7 00 

CTGAAGGTGT CCCAGGTGAC AACGTTGGTT TCAACGTCAA GAACGTTTCC 750 

GTCAAGGAAA TCAGAAGAGG TAACGTCTGT GGTGACTCCA AGAACGACCC 800 

ACCAAAGGGT TGTGASTCTT TCAACGCTCA AGTCATTGTC TTGAACCACC 8 50 

CAGGTCAAAT CTCTGCTGGT TACTCTCCAG TCTTGGATTG TCACACTGCC 90 0 

50 CACATTGCTT GTAAGTTTGA CACCTTGGTT GAAAAGATTG ACAGAAGAAC 950 

CGGTAAGAAG TTGGAAGAAA AC CCAAAGTT TGTCAAGTCC GGTGACGCTG 1000 

CTATCGTCAA GATGGTCCCA AC CAAGC CAA TGTGTGTTGA AGCYTTCACT 1050 

GACTACCCAC CATTGGGTAG ATTCGCTGTC AGAGA 1085 



2) INFORMATION FOR SEQ ID NO: 432 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 1072 bases 
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10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 73 51 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 



AGGTATTACC ATTGACATTG CCTTGTGGAA GTTCGAGACC CCCAAGTACC 5 0 

AGGTCACCGT CATTGACGCT CCTGGCCACA GAGATTTCAT TAAGAACATG 100 

15 AT C ACTGGT A CCTCCCAGGC TGACTGTGCC ATCTTGATCA TTGCTGGTGG 150 

TGTTGGTGAG TTCGAGGCTG GTATCTCCAA GGATGGCCAG ACCAGAGAGC 2 00 

ACGCCTTGCT TGCCTACACC TTGGGTGTCA AGCAATTGAT TGTTGCTGTC 2 50 

AACAAGATGG ACTCCGTCAA GTGGGACAAG AACAGATTCG AGGAGATTGT 3 00 

CAAGGAGACC TCCAACTTCG TCAAGAAGGT TGGCTACAAC CCCAAGACTG 3 50 

2 0 TCCCCTTCGT TCCCATCTCC GGTTGGAACG GTGACAACAT GATTGAGGCC 4 00 

TCCACCAACT GCCCTTGGTA CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG 4 50 

TAAGGTCACT GGTAAGACCT TGTTGGAGGC TATTGACGCC ATTGAGCCCC 50 0 

CCACCAGACC CACCGACAAG CCCTTGAGAT TGCCCTTGCA GGATGTCTAC 55 0 

AAGATTGGTG GTATTGGAAC GGTGCCCGTT GGCAGAGTTG AGAC CGGCAT 600 

25 CATCAAGGCC GGTATGGTTG TCACCTTTGC CCCCGCTGGT GTCACTACTG 650 

AAGTGAAGTC TGTCGAGATG CACCACGAGC AATTGGCTGA GGGTGTCCCA 700 

GGTGACAATG TTGGTTTCAA CGTGAAGAAC GTTTCCGTTA AGGAGAT CAG 7 50 

AAGAGGTAAC GTTTGCGGTG ACTC CAAGAA CGACCCCCCC AAGGCTGCTG 8 00 

CTTCTTTCAA CGCCCAGGTT ATCGTCTTAA ACCACCCCGG TCAAATCTCT 8 50 

3 0 GCTGGTTACT CTCCGGTTTT GGATTGCCAC ACTGCCCACA TTGCTTGCAG 900 

ATTCGACCAG TTGATTGAGA AGAT CGACAG AAGAACCGGT AAGAAGATGG 9 50 

AGGACGACCC TAAGTTC AT C AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 10 00 

GTTCCTTCCA AGCCCATGTG TGTTGAGGCC TTCACTGACT ACCCTCCCTT 1050 

GGGTCGTTTC GCTGTCAGAG AC 10 72 

35 



2 ) INFORMATION FOR SEQ ID NO: 433 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 
50 (B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433 

AGCACCAACT GGTCCGAGCC TCGTTTCAAC GAAATCGTCA AGGAAGTCTC 50 

55 CAACTTCATC AAGAAGGTCG GATACAACCC CAAGGCTGTT CCATTCGTCC 100 

CCATCTCTGG TTTCGAAGGT GACAACATGA TTCAACCCTC CACCAACGCT 15 0 

CCTTGGTACA AGGGCTGGAA CAAGGAGACC GCCTCTGGCA AGCACACTGG 200 

CAAGACCCTC CTCGACGCCA TTGATGCCAT CGACCCCCCA ACCCGCCCCA 250 

CCGAGAAGCC CCTCCGTCTC CCACTTCAGG ATGTGTACAA GATCTCTGGT 300 

60 ATCGGAACAG TCCCAGTCGG CCGTGTCGAA ACCGGTGTTA TCAAGCCTGG 3 50 
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TATGGTTGTG ACCTTCGCTC 
TCGAAATGCA CCACCAGCAG 
GGCTTCAACG TCAAGAACGT 
CGCTGGTGAC TCCAAGAACG 
5 CCCAGGTCAT CGTCCTCAAC 
CCAGTCCTTG ACTGCCACAC 
CCTCGAGAAG ATCGACCGCC 
AGTTCATCAA GTCTGGTGAT 
C 



CTTCCAACGT CACCACTGAA 
CTCACCCAGG GTAACCCTGG 
CTCTGTCAAG GAAGTCCGCC 
ACCCACCAAA GGG CTGCGAC 
CACCCTGGTC AAGTCGGTGC 
TGCCCACATT GCTTGCAAGT 
GTACCGGTAA ATCCGTTGAG 
GCCGCTATCG TCAAGATGGT 



GTCAAGTCCG 4 00 

TGACAACGTT 450 

GCGGTAACGT 500 

TCCTTCAACG 5 50 

TGGTTATGCC 600 

TCTCCGAGCT 65 0 

AACAACCCCA 7 00 

TCCATCCAAG 750 
751 



2) INFORMATION FOR SEQ ID NO: 434 

15 ( i ) S EQUENCE CHARACTERI STICS : 

(A) LENGTH: 1146 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albldus 
25 (B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 

AAGCTCAAGG CCGAGCGAGA GCGAGGTATC ACCATCGACA TCGCCTTGTG 5 0 

30 GAAGTTCGAG ACCCCCAAGT ACAATGTCAC CGTCATTGAC GCCCCCGGTC 100 

ACCGAGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA GGCCGACTGT 150 

GCCATCCTCA TCATCGCCTC CGGTATCGGA GAGTTCGAGG CTGGTATCTC 2 00 

CAAGGACGGT CAGACCCGAG AGCACGCCCT TTTGGCCTTC ACCCTCGGTG 2 50 

TCCGACAGCT CATCATTGCC ATCAACAAGA TGGACACCTG CAAGGTTAGT 3 00 

3 5 TCGCAGGTCC TGGTCTCTGT ACGAATCTTG CTGACCCCTT TTACAGTGGT 3 50 

CCGAAGACCG ATACAACGAA ATCGTCAAGG AGGCTTCCGG TTTCATCAAG 4 00 

AAGGTCGGAT ACAACCCCAA GACCGTTCCC TTCGTCCCCA TCTC CGGATG 4 50 

GCACGGAGAC AACATGTTGG AGGAGTCCAC CAACATGCCC TGGTACAAGG 500 

GATGGCACAA GGAGTCCAAG GCCGGTGTTG TCAAGGGAAA GACCTTGCTC 55 0 

40 GAGGC CATCG ACGCCATCGA GCCCCCTACC CGACCTTCCG ACAAGCCCTT 600 

GCGATTGCCC CTCCAGGATG TCTACAAGAT CGGTGGTATC GGTACGGTGC 65 0 

CCGTCGGTCG AGTCGAGACC GGTGTCATCA AGGCTGGTAT GGTCGTCACC 70 0 

TTCGCCCCTG CCAACGTCAC CACCGAAGTC AAGTCCGTCG AAATGCACCA 75 0 

CGAACAGCTC GCTGAGGGTG TTCCCGGTGA CAACGTCGGT TTCAACGTCA 8 00 

45 AGAACGTTTC CGTCAAGGAC ATCCGACGAG GAAACGTCTG CTCCGACTCG 850 

AAGAACGACC CCGCTATGGA GTCTGCTTCC TTCAACGCTC AGGTCATTGT 900 

CTTGAACCAC CCGGGTCAGA TCGGTGCCGG CTACTCCCCC GTTTTGGACT 950 

GCCACACCGC TCACATTGCC TGCAAGTTCG CTGAGCTCGT TGAGAAGATC 10 00 

GAC CGACGAA CCGGTAAGGT CATGGAGGCC GCCCCCAAGT TCGTCAAGTC 1050 

50 CGGTGACGCC GCCATCGTCA AGTTGATCCC GTCCAAGCCC ATGTGTGTCG 1100 

AGTCCTACTC CGAGTACCCC CCCTTGGGTC GATTCGCCGT CCGAGA 114 6 



55 2) INFORMATION FOR SEQ ID NO: 435 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 95 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala jeanselmei 

(B) STRAIN: ATCC 64755 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 



CAAGCTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCTTGT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CTGTCATCGA CGCCCCTGGT 100 

CATCGTGACT TTATCAAGAA CATGATCACT GGTACTTCCC AGGCTGACTG 15 0 

CGCCATTCTC ATCATTGCCG CCGGTACTGG TGAATT CGAA GCCGGTATCT 2 00 

15 CCAAGGATGG TCAGACTCGT GAGCACGCTC TGCTCGCCTA CACCCTGGGT 250 

GTCAAGCAGC TCATTGTCGC CATCAACAAG ATGGACACCA CCAAGTGGTC 3 00 

CGAGGATCGT TTCAACGAAA TCATCAAGGA GACTTCCAGC TTCATCAAGA 350 

AGGTCGGCTA CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC 4 00 

AACGGTGACA ACATGATCGA TGTCTCCACC AACTGCCCCT GGTACAAGGG 450 

2 0 CTGGGAGAAG GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTCG 500 

AGGCCATCGA CGCCATCGAC CCCCCCACTC GTCCCACCGA CAAGCCTCTC 550 

CGTCTTCCTC TCCAGGATGT CTACAAGATC TCTGGTATCG GAACGGTGCC 60 0 

CGTCGGTCGT GTTGAGACTG GTGTCATCAA GGCCGGTATG GTCGTTACCT 650 

TCGCTCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA AATGCACCAC 700 

2 5 GAACAACTCG CCGAGGGTGT TCCAGGTGAC AACGTTGGTT TCAACGTCAA 75 0 

GAACGTCTCC GTCAAGGAGG TTCGTCGTGG AAACGTCTGC GGTGACTCCA 80 0 

AGAACGACCC ACCCAAGGGT GCTGATTCCT TCAACGCCCA GGTCATCGTC 850 

TTGAACCACC CTGGT CAAGT CGGTGCTGGC TACGCCCCAG TGTTGGATTG 90 0 

CCACACTGCC CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATTG 950 

3 0 ACCGCCGTAC CGGTAAATCC ATC GAAAAC A ACCCCAAGTT CATCAAGTCT 10 00 

GGTGACGCTG CCATCGTCAA GATGGTTCCC AGCAAGCCCA TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CTCTTGGTCG TTTCGCCGTC CGTGA 1095 



2) INFORMATION FOR SEQ ID NO: 436 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 1113 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Fusarium oxysporum 

(B) STRAIN: WSA-212 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 

AAG CTCAAGG CCGAG CGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

GAAGTTCGAG ACTCCTCGCT ACTATGTCAC CGTCATTGGT ATGTTGTCGC 10 0 

TCATGCTTCA TTCTACTTCT CTTCGTACTA ACACATCACT CAGACGCTCC 15 0 

55 CGGTCACCGT GATTT CAT C A AGAACATGAT CAC TGGTACT TCCCAGGCCG 2 00 

ATTGCGCCAT TCTCATCATT GCCGCCGGTA CTGGTGAGTT CGAGGCTGGT 2 50 

ATCTCCAAGG ATGGC CAGAC CCGTGAGCAC GCTCTTCTTG CCTACACCCT 3 00 

TGGTGTCAAG AACCTCATCG TCGCCATCAA CAAGATGGAC AC CAC CAAGT 350 

GGTCTGAGGC CCGTTACCAG GAGATCAT CA AGGAGACCTC CTCTTTCATC 4 00 

60 AAGAAGGTCG GCTACAACCC CAAGGCTGTC GCTTTCGTCC CCATCTCCGG 4 50 
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TTTCAACGGT GACAACATGC TTACCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGTTGGGA GCGTGAGATC AAGTCCGGCA AGCTCACTGG CAAGACCCTC 550 

CTCGAGGCCA TTGACTCCAT CGAGCCCCCC AAGCGTCCCG TTGACAAGCC 600 

CCTTCGTCTT CCCCTTCAGG ATGTC TACAA GATCGGTGGT ATTGGAACGG 650 

5 TTCCCGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG TATGGTCGTT 700 

ACCTTCGCTC CTTCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACGAGCAA CTCACTGAGG GCCAGCCCGG TGACAACGTT GGTTTCAACG 80 0 

TGAAGAACGT CTCCGTCAAG GACATCCGAC GTGGTAACGT CGCTGGTGAC 850 

TCCAAGAACG ACCCCCCTAT GGGTGCCGCT TCTTTCACCG CCCAGGTCAT 900 

10 CGTCCTCAAC CACCCCGGTC AGGTCGGTGC TGGTTACGCT CCCGTCCTCG 950 

ATTGTCACAC TGCCCACATT GCCTGCAAGT TCGCCGAGAT CCAGGAGAAG 1000 

ATCGACCGCC GAACCGGTAA GGCTACTGAG GCCGCCCCCA AGTTCATCAA 1050 

GTCTGGTGAC TCCGCCATCG T C AAGATGGT TCCCTCCAAG CCCATGTGTG 110 0 

TTGAGGCTTT CAC 1113 

15 

2) INFORMATION FOR SEQ ID NO: 437 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Geotrichum spp . 

3 0 (B) STRAIN: LEV- 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 

GGTCCGAGGA CAGATTCAAC GAGATTGTCA AGGAGACTTC CAACTTCATC 50 

35 AAGAAGGTTG GTTAYAACCC CAAGACTGTT GCTTTCGTCC CCATCTCTGG 100 

TTGGAACGGT GACAACATGA TTGAGCCCTC CACCAACTGC CCCTGGTACA 150 

AGGGATGGCA GAAGGAG AC C AAGGCTGGTG TCACTAAGGG TAAGACCCTC 2 00 

CTTGAGGCCA TCGATGCCAT TGAGCCCCCT GTCAGACCTT CCGACAAGCC 250 

CCTCCGTCTT CCCCTCCAGG ATGTCTACAA GATCGGTGGT ATCGGAACTG 300 

4 0 TGCCCGTCGG CCGTGTCGAA ACCGGTGTCA TCAAGGC CGG TATGGTCGTC 350 

ACCTTCGCCC CCGCTGGTGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 4 00 

CCACGAGCTC CTCACTGAGG GTCTCCCCGG TGACAACGTT GGTTTCAACG 450 

TCAAGAACGT CTCCGTTAAG GATATCAGAC GTGGTAACGT CTGCGGTGAC 50 0 

TCCAAGAACG ATCCCCCCAA GGCTTGCGCT TCTTTCAACG CCCAGGTCAT 550 

45 TATCTTCAAC CACCCTGGTC AGATCTCTGC TGGATACTCT CCCGTCCTTG 600 

ATTGCCACAC CGC CCATATT GCTTGCAAGT TCGACACTTT GATCGAGAAG 650 

ATTGAC CGTC GTACTGGTAA GAAGACTGAG GACTCCCCCA AGTTCGTCAA 700 

GGCCGGTGAT GCTGCTATCG TCAAGA 72 6 



2) INFORMATION FOR SEQ ID NO: 438 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

6 0 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulation 

(B) STRAIN: G186A5 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438 

CACCACCAAG TGGTCCGAGT CCCGTTTCAA CGAAATC AT C AAGGAGGTTT 50 

CCAACTTCAT CAAGAAGGTC GGATATAACC CCAAGGCTGT TCCCTTCGTG 10 0 

10 CCAATCTCTG GTTT CGAGGG TGACAACATG ATTGAACCCT CCCCCAACTG 15 0 

CACATGGTAC AAGGGCTGGA ACAAGGAGAC TGCCTCTGGC AAGTCTTCTG 20 0 

GTAAAAC C C T TCTCGATGCC ATTGACGCCA TTGAACCCCC AACCCGTCCT 250 

ACCGATAAGC CCCTCCGTCT TCCCCTCCAG GATGTGTACA AAATCTCTGG 3 00 

TATTGGCACT GTTCCCGTCG GACGTGTTGA GACTGGTGTC ATCAAGCCCG 350 

15 GTATGGTCGT GACTTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC 400 

GTCGAAATGC ACCACCAACA ACTCCAGGCT GGTTACCCTG GCGACAACGT 450 

CGGCTTCAAC GTCAAGAACG TTTCAGTCAA GGAAGTCCGC CGTGGCAACG 500 

TTGCTGGCGA CTCCAAAAAT GATCCCCCTA AGGGCTGCGA ATCCTTCAAT 550 

GCCCAGGTCA TCGTCCTTAA CCACCCCGGC CAGGTTGGCG CTGGTTATGC 600 

20 CCCAGTCCTC GACTG C CAC A CTGCCCACAT TGCTTGCAAG TTCTCTGAGC 650 

TTATTGAGAA GATCGACCGC CGTACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCTGGTGA TGCTGCTATC GTCAAGATGG TTCCCTCCAA 750 

GCCC 754 

25 

2) INFORMATION FOR SEQ ID NO: 43 9 

( i ) SEQUENCE CHARACTERI STI CS : 
3 0 (A) LENGTH: 743 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Iasatchenkia orientalie kudrjanzev 

(B) STRAIN: ATCC 6258 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 

TGGGATGAAA ACAGATTTGA AGAAATTGTC AAGGAAACCC AAAACTTCAT 50 
CAAGAAGGTT GGTTACAACC CAAAGAC TGT TCCATTCGTT CCAATCTCTG 100 

4 5 GTTGGAATGG TGACAACATG ATTGAAGCAT CCACCAACTG TCCATGGTAC 150 

AAGGGT TGGA CTAAGGAAAC CAAGGCAGGT GTTGTTAAGG GTAAGACCTT 2 00 

ATTAGAAGCA ATCGATGCTA TTGAACCACC TGTCAGACCA AC CGAAAAGC 2 50 

CATTAAGATT ACCATTACAA GATGTTTACA AGATTGGTGG TATTGGTACT 3 00 

GTGCCAGTCG GTAGAGTCGA AACCGGTGTC ATTAAGCCAG GTATGGTTGT 350 

50 CACTTTTGCT CCAGCAGGTG TCACCACCGA AGTCAARTCC GTTGAAATGC 4 00 

ACCATGAACA ATTAGAACAA GGTGTTC C AG GTGATAACGT TGGTTTCAAC 450 

GTTAAGAACG TCTCTGTCAA GGATATCAAG AGAGGTAACG TTTGTGGTGA 50 0 

CTC CAAGAAC GACCCACCAA TGGGTGCAGC TTCYTTCAAT GCTCAAGTCA 550 

TTGTCTTGAA CCACCCTGGT CAAATTTCCG CTGGTTACTC TCCAGTCTTG 600 

5 5 GATTGTCACA CTGCCCACAT TGCATGTAAG TTCGACGAAT TAATC GAAAA 650 

GATTGACAGA AGAACTGGTA AGTCTGTTGA AGACCATCCA AAGTCYGTCA 700 
AGTCTGGTGA TGCAGCTATC GTCAAGATGG TCCCAACCAA GCC 743 



60 
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2) INFORMATION FOR SEQ ID NO: 440 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1091 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 

CAAGCTCAAG GCTGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTGT 5 0 

GGAAGTT CGA GACCCCTAAG TAC CACGTTA CCGTCATTGA CGCTCCTGGT 100 

CACCGTGACT TCATCAAGAA CATGATTACG GGTACCTCGC AGGCTGACTG 150 

20 CGCTATCCTC ATCATTGCCG GTGGTACCGG TGAGTTCGAG GCTGGTATCT 2 00 

CGAAGGACGG TCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTGGGT 250 

GTGCGTCAGC TCATTGTGGC CGTCAACAAG ATGGACACCA CCAAGTACTC 3 00 

GGAGGACCGC TTCAACGAGA TTGTCCGCGA AGTGTCGAAC TTCATCAAGA 3 50 

AGGTCGGTTT CAACCCCAAG ACTGTTGCCT TCGTCCCCAT CTCGGGCTGG 4 00 

25 CACGGTGACA ACATGATCGA GGCCACCACC AACATGC CTT GGTACAAGGG 4 50 

CTGGGAGAAG GAGAC CAAGT CGGGCAAGGT CACTGGTAAG ACTCTGCTGG 5 00 

ACGCCATCGA CGC CAT C GAG CCCCCGACCC GCCCCACTGA CAAGCCCCTG 550 

CGTCTCCCTC TGCAGGATGT GTACAAGATC GGTGGTATCG GTACTGTCCC 600 

TGTCGGTCGT GTTGAGACCG GTGTGATCAA GCCCGGTATG GTTGTGACCT 6 50 

30 TCGCTCCCTC GAACGTCACC ACTGAAGTTA AGTCGGTTGA GATGCACCAC ■ 7 00 

GAGTCGCTCC CTGAGGGTCT CCCCGGTGAC AACGTTGGTT TCAACGTGAA 7 50 

GAACGTCTCG GTTAAGGACA TTCGCCGTGG TAACGTTGCC TCGGACTCGA 8 00 

AGAACGACCC CGCTCAGGAG GCTGCTTCGT TCAACGCGCA GGTCATTGTC 8 50 

ATGAACCACC CTGGTCAGAT CAGCAACGGT TACTCGCCCG TGCTTGACTG 9 00 

35 CCACACTGCG CACATTGCCT GCCGCTTCAA CAACATCCTC CAGAAGATCG 9 50 

ACCGTCGCTC GGGTAAGGTG CTTGAGGAGA ACCCCAAGTT CATCAAGTCG 1000 

GGTGACGCTG CCATGGTGGA GATGATCCCC ACCAAGCCCA TGTGTGTGGA 105 0 

GTCGTTCAAC GAGTACCCCC CTCTGGGTCG TTTCGCTGTG C 10 91 

40 

2) INFORMATION FOR SEQ ID NO: 441 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 74 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441 

ACCACCAAGT AC TCGGAGGA CCGCTTCAAC GAGATTATTC GCGAGACTTC 50 
GAACTTCATC AAGAAGGTCG GTTACAACCC GAAGACTGTT GCCTTCGTCC 10 0 

6 0 CGATCTCGGG CTGGCACGGT GACAACATGA TTGAGGCGAC CACCAACATG 15 0 
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CCGTGGTACA AGGGCTGGGA GAAGGAGACC 
TAAGACCCTT CTGGACGCTA TTGACGCCAT 
CCGACAAGCC TCTCCGTCTT CCTCTCCAGG 
ATCGGTACYG TCCCGGTCGG CCGTGTTGAG 
5 TATGGTTGTG ACCTTCGCTC CGTCGAACGT 
TCGAGATGCA CCACGAGCAG ATCCCTGAGG 
GGTTTCAACG TGAAGAACGT GTCGGTCAAG 
CGCCTCGGAC TCGAAGAACG ACCCGGCTCA 
CTCAGGTCAT TGTGATGAAC CACCCTGGTC 
10 CCRGTGCTCG ACTGCCACAC TGCTCACATT 
CCTCCAGAAG ATCGACCGTC GTTCGGGTAA 
AGTTCATCAA GTCGGGTGAC GCTGCCATGG 



AAGTCGGGCA AGGCCACTGG 2 00 

TGAGCCGCCG ACGCGCCCGA 25 0 

ATGTGTACAA GATCGGTGGT 30 0 

ACCGGTGTTA TCAAGCCCGG 3 50 

CACSACTGAA GTTAAGTCGG 4 00 

GTCTTCCGGG TGACAACGTT 4 50 

GACATTCGCC GTGGTAACGT 500 

GGAGGCTGCC TCGTTCAATG 550 

AGATCAGCAA CGGTTACTCG 600 

GCCTGCCGCT TCAACAACAT 65 0 

GGTTCTYGAA GAGAACCCCA 700 

TTGAGATGAT CCCGACCAA 74 9 



15 



2) INFORMATION FOR SEQ ID NO: 442 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1150 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malhranchea filamentosa 

(B) STRAIN: ATCC 48174 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 

ACTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CATGTCACCG TCATTGGTAC GTTCGACATG 100 

TTCGACCTTT TGCCTAGTGT CCCCTTCTAA CCACAGTTTA TAGACGCCCC 150 

35 TGGCCATCGT GATTTCGTCA AGAACATGAT CACTGGTACT TCCCAGGCTG 2 00 

ATTGCGCTAT CCTCATCATT GCTTCCGGCA CTGGTGAATT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTTT CCTTCACCCT 300 

CGGTGTTAGG CAGC TCATTG TCGCCCTCAA CAAGATGGAC ACTGTCAACT 350 

TCGCTGAGGC CCGTTACAAC GAGATTGTCA AGGAAGTCTC CAACTTCATC 4 00 

40 AAGAAGGTCG GCTACAACCC CAAGGCCGTT CCTTTCGTCC CCATCTCCGG 450 

TTTCGAGGGT GACAACATGA TCGAGGC CTC CACCCGCATT CCTTGGTACA 500 

AGGGCTGGAA CAAGGAGACC GCCAGTGGCA AGAGCAC CGG CAAGACCCTY 550 

CTCGAGGCCA TTGATGC CAT CGAACCCCCG GTCCGTCCCA CCGACAAGCC 600 

CCTYCGTCTC C CTCTTCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 650 

45 TTCCTGTCGG TCGTGTTGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTT 700 

ACTTTCGCCC CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACCAGCAG CTCCAGGCCG GTAACCCCGG TGACAACGTC GGCTTCAACG 800 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCTCCGAC 850 

TCCAAGAACG ACCCCGCCAA GGGCTGCGAC TCCTTCAACG CCCAGGTCAT 90 0 

50 CGTCCTTAAC CACCCCGGTC AGGTCGGTGC TGGATACGCT CCCGTCCTCG 950 

ATTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT TCTTGAGAAG 1000 

ATCGATCGCC GTAC CGGTAA ATCCGTTGAG GACCACCCCA AGTTCATCAA 105 0 

GTCTGGTGAT GCCGCTATCG TCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

TTGAGGCTTT CACTGACTAC CCTCCCCTTG GTCGTTTCGC CGTCCGTGAC 1150 

55 



60 



2) INFORMATION FOR SEQ ID NO: 443 
( i ) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulchemma 

10 (B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 

GGACAAGTTG AAGGCTGAGA GAGAGAGAGG TATCACCATC GACATTGCCT 50 

15 TGTGGAAGTT CGAGACTCCT AAGTACCACG TCACCGTYAT TGACGCCCCA 100 

" GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAGGCTGA 150 

CTGTGCYATC TTGATTATCG CYGGTGGTGT TGGTGAGTTC GAGGCTGGTA 2 00 

TCTCCAAGGA TGGCCAGACC AGAGAGCACG CTTTGTTGGC YTACACCTTG 250 

GGTGTTAGAC ARTTGATTGT TGCCGTCAAC AAGATGGACT CTGTCAAGTG 3 00 

2 0 GGACAAGAAC AGATTTGAGG AGATCATCAA GGAGACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACYGTGC CATTCGTGCC AATYTCTGGT 4 00 

TGGAACGGTG ACAACATGAT TGAGGCYTCC ACTAACTGCC CATGGTACAA 4 50 

GGGTTGGGAG AAGGAGAC C A AGGCCGGTAA GTCTWCCGGT AAGACCTTGT 500 

TGGAGGC CAT TGACGCCATT GAGCCACCAA CCAGACCTAC CGACAAGGCC 550 

2 5 TTGAGATTGC CTTTGCAGGA TGTCTACAAG ATCGGTGGTA TCGGAACGGT 6 00 

GCCAGTCGGC CGTGT C GAGA CCGGTGTCAT TAAGGCCGGT ATGGTTGTYA 650 

CCTTYGCCCC AGCTGGTGTC ACCACTGAGG TCAAGTCCGT CGAGATGCAC 7 00 

CACGAGCAGT TGGTCGAGGG TCTTCCAGGT GACAAYGTTG GTTTCAACGT 7 50 

CAAGAACGTC TCCGTTAAGG AGATCAGAAG AGGTAACGTC TGTGGTGACT 8 00 

3 0 CCAAGCAGGA CCCACCAAAG GGTGCCGCTT CTTTCACCGC YCAGGTTATT 8 50 

GTGTTGAACC ACCCTGGTCA GATCTCCTCT GGTTACTCTC CAGTGTTGGA 900 

CTGYCACACC GCCCACATTG CCTGTAARTT CGACACCTTG TTGGAGAAGA 9 50 

TTGACAGAAG AACTGGTAAG TCCTTGGAGT CYGAGCCTAA GTTCGTCAAG 10 00 

TCYGGTGACG CCGCCATTGT CAAGATGGTG CCAACCAAGC CAATGTGTGT 10 50 

3 5 TGAGGCTTTC ACCGACTACC CACCTTTGGG TAGATTCGCC GTCAGAGAC 1099 



2) INFORMATION FOR SEQ ID NO: 444 



40 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1153 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 42570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 

5 5 CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGACT 10 0 

CGCGCGAGAC TGGTCGCAAT TTCCACGTCG CTAACGTGCT TGAACAGACG 150 

CTCCCGGCCA CCGTGACTTC ATCAAGAACA TGATCACTGG TACCTCCCAG 20 0 

GCTGACTGCG CTATCCTCAT TATCGCTGCC GGCACTGGTG AGTTCGAGGC 250 

6 0 TGGTATCTCC AAGGATGGCC AGACCCGTGA GCACGCTCTG CTCGCCTACA 3 00 
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CCCTCGGTGT TAAGCAGCTC ATCGTCGCTA TCAACAAGAT GGACACCACC 350 

SSgGTCTG AGGCCCGTTT CCAGGAGATC ATCAAGGAGA CCTCCAACTT 4 00 

^??SgSg ctSgCTACA ACCCCAAGAC CGTCGCTTTC GTCCCCATCT 450 

SgGTTTCCA CGGCGACAAC ATGCTTTCCC CCTCCACCAA CTGCCCCTGG 500 

5 GGGAGAAGGA GACCAAGGCT GGCAAGTCCA CCGGCAAGAC 550 

CCTCCTTGAG GCCATCGACT CCATCGAGCC CCCCAAGCGC CCCAGCGACA 600 

aScStCCG COTCCCCTT CAGGATGTGT ACAAGATCGG CGGTATCGGC 650 

A?AGTC?c5g 5cGGCCGTAT CGAGACTGGT GTCATCAAGC CCGGCATGGT 700 

PPTrACCTTC GCTCCTTCCA ACGTCACCAC CGAAGTCAAG TCCGTTGAGA 750 

10 SSS^SSS gSgCTCTCC GAGGGTGTCC CCGGTGACAA CGTCGGCTTC 800 

SStCAAGA ACGTCTCCGT CAAGGAGATC CGTCGTGGCA ACGTCGCCGG 850 

™SSm£ AACGACCCCC CTCTGGGTGC CGCTTCTTTC GATGCCCAGG 900 

?S?CGTCCT CAACCACCCC GGCCAGGTCG GTGCTGGCTA CGCCCCCGTC 950 

togactccc acaccgccca CATTGCCTGC AAGTTCGCCG AGATCAAGGA 1000 

15 SaAGATCGAC CGCCGTACCG GCAAGTCTGT CGAGTCCGCC CCCAAGTTCA 1050 

TCAAGTCTGG CGACTCTGCC ATCGTCAAGA TGATTCCCTC CAAGCCCATG 1100 

TGCGTTGAGG CTTTCACCGA CTACCCTCCT CTGGGCCGCT TCGCCGTCCG 1150 

TGA 1153 



20 

2 ) INFORMATION FOR SEQ ID NO: 445 

( i ) SEQUENCE CHARACTERI STICS : 
25 (A) LENGTH: 763 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32 071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 

TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

40 CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 2 00 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 2 50 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AGATCTCCGG 3 00 

TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 350 

4 5 GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 400 

GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 450 

CGGCTT CAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 500 

TTGCCGGTGA CTCTAAGAAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 6 00 

50 CCCAGTCCTC GACTGCCATA CTGCCCACAT TGC CTGCAAA TTCGCTGAGC 650 

TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTC CTTCCAA 750 

GCCCATGTGC GTC 763 



55 

2) INFORMATION FOR SEQ ID NO: 446 

(i) SEQUENCE CHARACTERISTICS: 
6 0 (A) LENGTH: 134 6 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicilllum marneffei 

(B) STRAIN: ATCC 64101 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446 

AAGCTCAAGG CTGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

GAAGTTCCAG ACTGCCAAGT ACGAGGTTAC CGTCATTGAC GCCCCCGGTC 100 

15 ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA GGCCGATTGC 15 0 

GCTATTCTCA TCATTGCCTC TGGTACTGGT GAATTCGAGG CTGGTATCTC 2 00 

CAAGGATGGC CAGACTCGTG AGCACGCTCT TTTGGCTTTC ACCCTCGGTG 250 

TCCGTCAGCT CATTGTTGCC CTCAACAAGA TGGACACTTG CAAGTGGTCT 300 

CAGGGTGAGT ACTCGTACCT GCGTTTGGCC TTGAATATCT TACTAATGCA 350 

2 0 C CATAGATCG TTACAACGAA ATTGTCAAGG AGACTTCCAA CTTCATCAAG 400 

AAGGTCGGAT ACAACCCCAA GAACGTTCCT TTCGTTCCTA TCTCCGGTTT 450 

CAACGGTGAC AACATGCTTG AGCCCTCCCC CAACTGCCCC TGGTACAAGG 500 

GTTGGGAGAA GGAGAC CAAG GCCGGTAAGG TCACTGGTAA GACCCTCCTC 550 

GAGGC CATCG ACGCCATTGA GCCCCCTACC CGTCCCGCCA ACAAGGT TAG 600 

25 TCCCTCCTCG ACTACTCAAA CCCTCCTCAT AAGTTCAGAT TACTGACTCG 650 

TTCACAGCCC CTCCGTCTTC CCCTCCAGRA TGTCTACAAG ATCGGTGGTA 70 0 

TTGGAACGGT TCCCGTCGGT CGTGTTGAGA CTGGTACCAT CGTTCCTGGT 75 0 

ATGGTTGTCA CCTTGTAAGT CACTCTCCTC GCTTATCCTA CCTGAAATCA 8 00 

TCATGTGCTA ACTTGACACT CAGCGCTCCC GCCAACGTCA CCACTGAAGT 8 50 

30 CAAGAGTGTT GAAATGCACC ACCAGCAGCT CACTGCCGGT CAGCCCGGTG 900 

ACAACGTTGG TTTCAACGTG AAGAACGTCT C CGTCAAGGA AATCCGTCGT 950 

GGTAACGTTG CTGGTGACAG CAAGAACGAC CCCCCTGCCG GTGCTGCCTC 1000 

CTTCAACGCC CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCTG 1050 

GTTACGCCCC AGTCCTCGAT TGCCACACTG CCCACATTGC TTGCAAGTTC 1100 

3 5 GCTGAGCTCC TCGAGAAGAT TGACCGTCGT AC CGGAAAGT CTGTTGAGGA 1150 

CCACCCCAAG TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC 12 00 

CTTCCAAGCC CATGTGTGTT GAGGCTTTCA CCGAGTACCC TCCTCTCGGT 1250 

CGTTTCGCCG TTCGCGAGTA AGTTTTATCT CCGTTGTCTA TTTTCCATCC 1300 

TTCCCTTCTC CTCCGTCTTC CATATATACT TTTTCAGTTA TATGTG 134 6 

40 



2) INFORMATION FOR SEQ ID NO: 447 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plchia anomala. 
55 (B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447 

AAGTTAAAAG CTGAACGTGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 
60 GAAATTCGAA ACTCCAAAAT AC CATGTTAC CGTTATTGAT GCTCCAGGTC 100 
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araraPATTT CATCAAAAAT ATGATTACTG GTACTTCCCA AGCTGATTGT 150 

2SSS!E£ SSSccSS TGGTATTGGT GAATTCGAAG CTGGTATCTC 200 

SliSrrGT CAAACCAGAG AACACGCTTT NTTAGCTTAC ACCTTAGGTG 25 0 

GATOGTTGCT ATCAACAAGA TGGATTCCGT ^ATGGGAT 300 

. ?I???TCGTT TCGAAGAAAT TGT C AAGGAA ACYTCAAACT TTATCAAGAA 3 50 

^SSc AACCCAAAAA CTGTTCCATT CGTTCCAATC TCAGGTTGGA 400 

ISSotSSS CATGATTGAA CCATCAWCTA ACTGTCCATG GTACAAAGGT 450 

TOgSSSS AAACCAAAGC TGGTGAAGCT AAAGGTAAAA CTTTATTAGA 500 

SSotSSt SctATTGACC CACCATCAAG ACCAACTGAT AAACCATTAC 550 

10 S?tScCMT ACAAGATGTT TACAARATTG GTGGTATTGG TACTGTGCCA 600 

GTOgSaSg TTGAAACCGG TGTTATCAAA CCAGGTATGG TTGTTACCTT 650 

TOOTCAGCT GGTGTTACCA CTGAAGTCAA ATCTGTTGAA ATGCATCATG 700 

S^SSS? TGAAGGTTTA CCAGGTGACA ATGTTGGTTT CAACGTTAAG 750 

AATGTTTCTG TTAAAGAAAT CCGTCGTGGT AACGTCTGTG GTGACTCTAA 800 

15 SSSSScS CCAAAAGCTG CTGAATCATT CAATGCTCAA GTTATTGTCT 850 

^CCATCC AGGTCAAAT C TCTGCTGGTT ACTCTCCAGT TTTAGATTGT 900 

CACACTGCTC ACATTGCTTG TAAATTTGAC ACTTTAATTG AAAAAATTGA 950 

CAGACGTACT GGTAAGAAAT TAGAAGAAGC TCCAAAATTC ATCAAATCAG 1000 

GTGATGCTGC TATTGTTAAA TTTGTTCCAT CAAAACCATT ATCAGTTGAA 1050 

2 0 GCTTTCACTG ACTACCCACC ATTAGGTCGT TTCGCTGTCA GAGA 10 94 



2) INFORMATION FOR SEQ ID NO: 448 

25 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 5 (A) ORGANISM: Pichla anomala 

(B) STRAIN: ATCC 2149 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 

40 CTTAGATAAG TTAAAAGCTG AACGTGAAAG AGGTATCACT ATTGATATTG 5 0 

CTTTATGGAA ATTCGAAACT CCAAAATACC ATGTTACCGT TATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAAAATATG ATTACTGGTA CTTCCCAAGC 150 

TGATTGTGCT ATMTTAATTA TTGCCGGTGG TATTGGTGAA TTCGAAGCTG 200 

GTATCTCAAA AGATGGTCAA ACCAGAGAAC ACGCTTTRTT AGCTTACACC 2 50 

45 TTAGGTGTTA AACAATTGAT TGTTGCTATC AACAAGATGG ATTCCGTTAA 3 00 

ATGGGATGAA TCTCGTTTCG AAGAAATTGT CAAGGAAACY TCAAACTTTA 350 

TCAAGAAAGT TGGTACAACC CAAAAACTGT TCCATTCGTT CCAATCTCAG 400 

GTTGGAATGG TGATAACATG ATTGAACCAT CAACTAACTG TCCATGGTAC 45 0 

AAAGGTTGGA AAAAAGAAAC CAAAGCTGGT GAAGCTAAAG GTAAAACTTT 500 

50 ATTAGAAGCC ATTGATGCTA TTGATCCACC ATCAAGACCA ACTGATAAAC 550 

CATTACGTTT ACCATTACAA GATGTTTACA ARATTGGTGG TATTGGTACT 600 

GTGCCAGTCG GTAGAGTTGA AACCGGTGTT ATCAAACCAG GTATGGTTGT 650 

TACCTTTGCC CCAGCTGGTG TTACCACTGA AGTCAAATCT GTTGAAATGC 700 

ATCATGAACA ATTGACTGAA GGTTTACCAG GTGACAATGT TGGTTTCAAC 7 50 

55 GTTAAGAATG TTTCTGTTAA AGAAATCCGT CGTGGTAACG TCTGTGGTGA 800 

CTCTAAAAAC GATCCACCAA AAGCTGCTGA ATCATT CAAT GCTCAAGTTA 850 

TTGTCTTAAA CCATCCAGGT CAAATCTCTG CTGGTTACTC TCCAGTTTTA 900 

GATTGT CACA CTGCTCACAT TGCTTGTAAA TTTGACACTT TAATTGAAAA 950 

AATTGACAGA CGTACTGGTA AGAAATTAGA AGAAGCTCCA AAATTCATCA 1000 

60 AATCAGGTGA TGCTGCTATT GTTAAATTTG TTCCATCAAA ACCATTATCA 1050 

403 



CA 02283458 1999-09-28 



GTTGAAGCTT TCACTGACTA CCCACCATTA GGTCGTTTCG CTGTCAGAGA 110 0 



5 2) INFORMATION FOR SEQ ID NO: 449 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 01 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydll 

(B) STRAIN: ATCC 44331 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 



CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCGA GACCCCCAAG TACCAGGTCA CCGTCATTGG TATGTC TTTG 10 0 

TGCTTTTTGT GCTTTTTGTG CCTGTGCCTC GCACAATTCC AGCCCTCGAT 15 0 

AATTATGAAC CTCGTACTAA TATGTCGTTC TCCCACTACC CACAGACGCC 200 

2 5 CCCGGCCATC GTGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC 250 

TGATTGCGCC ATTCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 3 00 

GCATCTCCAA GGATGGCCAG ACCCGTGAGC ACGCTCTTCT CGCCTTCACC 350 

CTCGGTGTCA AGAACCTCAT TGTTGCCATC AACAAGATGG AC AC CAACAA 4 00 

CTGGTCCGAG GAC CGATAC A AGGAGATCAT CAAGGAGACC TCCAACTTCA 450 

3 0 TCAAGAAGGT CGGCTACAAC CCCAAGGCCG TTCCTTTCGT CCCCATCTCC 500 

GGTTTCCACG GAGACAACAT GCTTACCCCC TCCACCAACT GCCCCTGGTA 550 

CAAGGGTTGG GTCCGTGAGG TCAAGGGTAA CACCCTTACC GGCAAGACCC 600 

TTCTCGAGGC CATCGACTYC ATCGAGCCCC CCAAGCGTCC CACCGAGAAG 650 

CCCCTCCGTC TTCCCCTTCA GGACGTCTAC AAGAT CGGTG GTATTGGCAC 70 0 

3 5 TGTGCCCGTC GGCCGTATCG AGACCGGTAT CCTCAAGCCC GGTATGGTCG 75 0 

TCACCTTCGC TCCCTCCAAC GTCACCACTG AAGTCAAGTC CGTCGAGATG 800 

CACCACGAGC AGCTTACCGA GGGTGTCCCC GGTGACAACG TTGGTTTCAA 850 

CGTGAAGAAC GTCTCCGTCA AGGATATCCG CCGTGGCAAC GTCTGCGGTG 900 

AC TCCAAGAA CGACCCCCCC GCTGCTGCCG CCTCTTTCCA GGCCCAGGTC 950 

4 0 ATTGTCCTCA ACCACCCCGG CCAGATCGGT GCTGGTTACG CTCCCGTTCT 1000 

TGACTGCCAC ACTGCCCACA TTGCTTGCAA GTTCGCCGAG CTCCTTGAGA 1050 

AGATCGAC C G CCGTACCGGT AAGTCGGTCG AGAACAACCC CAAGTTCGTC 1100 

AAGTCTGGTG ATGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1150 

TGTTGAGTCC TTCACCGAGT ACCCCCCTCT CGGTCGTTTC GCCGTCCGTG 1200 

45 A 1201 



2) INFORMATION FOR SEQ ID NO: 4 50 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Rhizopus oryzae 
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(B) STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450 

s TRACCTTAAG GCTGAACGTG AACGTGGTAT CACCATCGAT ATTGCTCTCT 50 

Sa1ot?c^a AACCCCCAAG TACCAAATTA CCGTTATTGA TGCTCCCGGT 100 

SSSSSSt tHtCAAGAA CATGATTACC GGTACTTCTC AAGCCGATTG 150 

tcSattctt ATCATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

?cSSg!?GG TCAAACCCGT GAACACGCCC TTTTGGCTTT CACTCTCGGT 250 

TO TGATTGTTGC TGTCAACAAG ATGGATACCA CCAAGTGGT C 300 

CGAAGCTCOT TTCAACGAAA TYGTCAAGGA AGTTTCTTCC TTCATCAAGA 350 

arATTGGTTA CAACCCCAAG TCTGTTCCCT TCGTCCCCAT CTCTGGTTGG 4 00 

SSSSSS ACATGTTGGA AGAATCTACC AACATGCCCT GGTACAAGGG 450 

ATGGAACAAG GAAACCAAGG CTGGTGCCAA GTCTGGTAAG ACCCTCTTGG 500 

ATGCCATTGA CAACATTGAC CCTCCTACCC GTCCTGTTGA CAAGCCTCTC 550 

CGTCTTCCTC TTCAAGATGT TTACAAGATT GGTGGTATCG GTACTGTCCC 600 

CGTCGGTCGT GTCGAAACTG GTGTCATCAA GGCTGGTATG GTTGTCACCT 650 

?CG??CCtSc TGCTGTCACC ACTGAAGTTA AGTCCGTCGA AATGCACCAC 700 

GAAACCCTCA CTGAAGGTCT CCCCGGTGAC AACGTCGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT TCTGACTCCA 800 

AgIScGACCC CGCCAAGGAA GCCGGTTCCT TCACCGCTCA AGTCATTATC 850 

Jt^CCACC CTGGTCAAAT TGGTGCTGGT TACGCTCCYG TTTTGGATTG 900 

TCACACTGCT CACATTGCCT GTAAGTTCGC TGAATTGATC GAAAAGATTG 950 

ACAGACGTTC CGGTAAGTCC TTGGAAGCTA CTCCCAAGTT CGTCAAGTCT 1000 

25 GGTGACTCTG CCATCGTCAA GATGATCCCC TCCAAGCCCA TGTGTGTTGA 1050 

AGCTTACACT GACTACCCTC CTCTCGGTCG TTTCGCTGTT CGTGA 1095 



30 2) INFORMATION FOR SEQ ID NO: 451 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1092 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 



GCTGAAGGCC GAGCGAGAGC GTGGTATCAC CATCGATATC GCTCTATGGA 50 

AGTTCGAGAC CCCCAAGTAC AACGTCACCG TCATTGACGC TCCAGGACAT 100 

CGTGATTTCA TCAAGAGTGA GTTAAC CATA ACATCAAACA GTGTTGCAAA 150 

CATCAGCTAA TGCATGTTAT GCGTCCAGAC ATGATTACTG GTACTTCCCA 200 

50 GGCCGATTGC GCTATTCTCA TCATCGCCAC CGGTGTTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACCCGAG AGCACGCCCT TCTCGCCTTC 3 00 

ACCCTCGGTG TCAGACAGCT CATCGTTGCC TTGAACAAGA TGGACTCGGT 350 

CAAGGTAGGC TAACTTCACA ACGTCGGCTT CCCATCATTC ATTCACTTAC 4 00 

CTGTCTTGTC TTCCACCCTC CAGTTCTCCG AGTCCCGATA CGATGAAATC 450 

55 GTCAAGGAGA CATCCGGTTT CATCAAGAAG GTCGGATTCG ACCCCAAGGG 50 0 

TGTTCCCTTC GTCCCCATCT CAGGATGGCA CGGAGACAAC ATGATCGAGG 550 

AGTCCACCAA CATGCCTTGG TACAAGGGAT GGAAGAAGAC CACCAAGACC 600 

GGCGAGTACA AGGGAAAGAC CCTGCTCGAG GCCATCGACT CCATCGAGCC 650 

CCCCACCCGT CCTACCGACA AGCCTCTCCG ACTTCCCCTC CAGGATGTCT 700 

60 ACAAGATTGG TGGTATCGGA ACAGTGCCAG TCGGACGAGT CGAGACTGGT 750 
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AC CAT C AAGG CTGGTATGGT 
CGAAGTCAAG TCTGTTGAGA 
CAGGTGACAA CGTCGGATTC 
CGAAGAGGAA ACGTCTGCGG 
5 TGCTTCCTTC ACCGCCCAGG 
GTAACGGATA CTCTCCAGTT 
CATATTAGTT TGAACTCTTT 
TTCAGCCCAC ATTGCATGCA 
GACGATCCGG AAAGTCCATC 
10 GACGCCGCCA TCGTCAAGAT 
TTTCACCGAC TACCCACCTC 



CGTCACCTTC GCTCCTTCAG 
TGCACCACGA GCAGCTCGAG 
AACAT CAAG A ACGTTTCAGT 
TGACTCCAAG AACGATCCCC 
TCATTGTCCT CAACCACCCC 
CTCGATTGCC ACACTGGTGA 
TGAACAATAC TAACGTGAAT 
AGTTCGACAC CCTCCTAGAG 
GAAGATACCC CCAAGTTCGT 
GGTCCCCACC AAGCCAATGT 
TTGGACGATT CGCCGTCCGT 



CTGTCACCAC 800 

GCTGGTCTTC 850 

CAAGGATATC 900 

CCAAGGAGGC 95 0 

GGTCAAATCG 100 0 

GTCATTCTTC 10 5 0 

CATTATACTT 1100 

AAGATTGACC 1150 

CAAGTCTGGT 12 00 

GCGTTGAGGC 12 50 

GA 1292 



15 2) INFORMATION FOR SEQ ID NO: 452 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 128 9 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: SporoholomyceB salmonicolor 

(B) STRAIN: ATCC 32311 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 



AAGCT C AAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCTCTCTG 50 

GAAGTT CGAG ACCCCCAAGG TGCGTTCTCA CCCCGGCTGA GGAGCACGCA 10 0 

CGCGAGGGCT CACGCTGCGC CTCTTACAGT ACATGATCAC CGTCATCGAC 150 

GCCCCGGGTC ACCGTGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA 200 

3 5 GGCCGACTGC GCCATCCTCA TCATCGCCGC CGGTACCGGT GAGTT CGAGG 250 

CTGGTATCTC GAAGGACGGC CAGACCCGCG AGCACGCCCT CCTCGCCTTC 300 

ACCCTCGGTG TCCGTCAGCT CATCGTCGCC ATCAACAAGA TGGACACGAC 3 50 

CAAGTACTCG GAGGCCCGTT TCGAGGAGAT CATCAAGGAG ACCTCCAACT 400 

TCATCAAGAA GGTCGGCTTC AACCCCAAGA ACGTCCCCTT CGTCCCCATC 4 50 

4 0 TCGGGATGGC ACGGTGACAA CATGATTGAG GAGACCGCCA ACATGCCCTG 500 

GTACAAGGGA TGGAAGAAGG AGACCAAGGC CGGTGAGGTC AAGGGCAAGA 550 

CCCTCCTCGA CGCCATCGAC GCGATCGAGC CCCCTTCGCG CCCTACCGAC 600 

AAGCCCCTCC GTCTTCCCCT CCAGGTTCGT TTCCCTGCTC GCGGTTTACG 650 

CTGCTACTTC GAGCTGACCC GCGAGCTCTG CCCGAACAGG ATGTCTACAA 700 

45 GATCGGTGGT ATCGGCACAG TCCCCGTCGG CCGTGTCGAG ACCGGCACGA 750 

TCAAGGCCGG TATGGTCGTC GTCTTCGCCC CGGCCAACGT CACCACTGAG 800 

GTCAAGTCGG TCGAGATGCA CCACGAGCAG CTCGAGGCTG GTCTCCCGGG 850 

AGACAACGTC GGCTTCAAGT GCGTCTCATC ATGTTTTTGC TTCGCTCGGC 900 

CATTTTTTCA GTCCTGACCC CGTTTTGCCC CTCGACAGCG TCAAGAACGT 950 

50 TTCC GTTAAG GACATCCGTC GCGGTAACGT CTGCGGTGAC TCGAAGAACG 1000 

ACCCCCCCAA GGAGGCCGCT TCCTTCAAGG CCCAGGTCAT CGTCATGAAC 1050 

CACCCCGGTC AGATCGGCAA CGGTTACGCT CCCGTTCTCG ACTGCCACAC 1100 

CGCCCACATT GCCTGC AAGT TCGACACCCT CCTCGAGAAG ATCGACCGTC 1150 

GCTCGGGCAA GTCGATTGAG GACCTCCCCA AGTTCGTCAA GTCGGGTGAC 12 0 0 

55 GCCGCCATCG TCAAGATGGT TCCCTCCAAG CCGATGTGTG TCGAGTCGTT 1250 

CGCCGAGTAC CCCCCTCTCG GACGTTTCGC CGTCCGTGA 1289 



60 2) INFORMATION FOR SEQ ID NO: 453 
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20 



25 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: , , 

(A) ORGANISM: Sporothrix schenckn 

(B) STRAIN: WSA-148 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453 

GTGAGCGCGG T ATC AC CAT C GATATTGCTC TGTGGAAGTT 
AAGTACTACG TCACCGTCAT TGACGCCCCC GGTCATCGCG 
GAACATGATC ACTGGTACCT CGCAGGCCGA CTGCGCCATT 
CCGCTGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA 
CGTGAGCACG CTCTGCTCGC CTACACCCTG GGTGTGCGGC 
CGCCATCAAC AAGATGGACA CGGCCAAGTG GGCTGAGGCT 
AGATCATCAA GGAGACCTCC AAC TTCATCA AGAAGGTCGG 
AAGACTGTTG CCTTCGTCCC CATCTCGGGC TTCCACGGCG 
TACTCCCTCG ACCAACTGCC CCTGGTACAA GGGCTGGGAG 
AGAGCGGCAA GGTTACCGGT AAGACTCTGC T GGACGC CAT 
GAGCCCCCCA AGCGCCCCAC GGACAAGCCC CTGCGTCTGC 
TGTCTACAAG ATCGGCGGTA TCGGCACTGT CCCTGTCGGC 
CTGGTGTCCT GAAGCCCGGC ATGGTCGTCA CCTTTGCCCC 
AC CACTGAAG TCAAGTCCGT CGAGATGCAC CACGAGCAGC 
TGTTCCCGGC GACAACGTCG GCTTCAACGT CAAGAACGTC 
AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA 
GGCGCCGCCA CCTTCAACGC CCAGGTCATT GTCCTGAACC 
GGTCGGCAAC GGCTACGCCC CGGTTCTGGA CTGCCACACC 
CCTGCAAGTT C AC CGAGATC CTTGAGAAGA TCGACCGCCG 
TCGGTTGAGA ACAACCCCAA GTTCATCAAG TCGGGTGACG 
CAAGC TGACG CCCTCGAAGC CCATGTGCGT TGAGGCCTTC 
CCCCTCTGGG CCGTTTCGCC 



CGAGACCCCC 50 

ATTTCATCAA 10 0 

CTCATCATTG 150 

TGGCCAGACT 200 

AGCTGATCGT 250 

CGTTACCAGG 300 

CTACAACCCC 3 50 

ACAACATGCT 400 

AAGGAGGGCA 450 

TGACGC CGTC 500 

CCCTCCAGGA 550 

CGTATCGAGA 600 

GTCCAACGTC 650 

TTGTTGAGGG 700 

TCCGTCAAGG 7 50 

CCCCCCCTCG 800 

ACCCCGGCCA 8 50 

GCCCACATTG 900 

TACCGGCAAG 950 

CCGCCATTGT 10 0 0 

ACTGACTACC 1050 
1070 
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2) INFORMATION FOR SEQ ID NO: 454 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1092 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454 

CTTAAGTCTG AGCGTGAGAG AGGTATCACC ATCGATATTG CTCTCTGGAA 50 

ATTCGAGACT CCTAAGTACA ACGTTACCGT CATTGATGCT CCAGGTCACA 100 

GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC CGATCTTGCC 150 

60 ATCCTTATCA TTGCTGGTGG TGTCGGTGAG TTCGAGGCTG GTATCTCCAA 2 00 
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GGACGGTCAG A CC AG AGAGC ACGCTCTTCT TTCTTTCACC CTTGGTGTCA 250 

GAAACATGAT TGTTGCTGTC AACAAGATGG ACTCCGTCAA GTGGTCTGAG 3 00 

GATCGTTTCA ACGAAATTGT CAAGGAGACC TCCAACTTCG TCAAGAAGGT 35 0 

TGGTTACAAC CCTAAGAATA TTGCTTTCGT TCCTATCTCC GGTTGGAACG 4 00 

5 GTGACAATAT GATTGAGCCA TCCACCAACT GCCCATGGTA CAAGGGTTGG 4 50 

GAGCGTGAGA CCAAGAACGG TACTGCCAAG GGTAAGACCA TCTTGGAGGC 500 

CATTGACTCT ATGGAGCCAC CTTCCAGACC AGTTGACAAG CCTCTCCGTC 550 

TTCCTCTTCA GGACGTTTAC AAGATTGGTG GTATTGGTAC GGTGCCAGTT 600 

GGTCGTGTTG AGACTGGTGT TATCAAGCCA GGTATGGTTG TTACCTTTGC 65 0 

10 CCCAGCTGGT GTCACCACTG AAGTCAAGTC TGT CGAGATG CACCACGAAC 700 
AGATCCCAGA AGGT AC CC CA GGTGACAACG TTGGTTTCAA CGTCAAGAAC 750 
GTCTCCGTCA AGGAAATCAG ACGTGGTAAC GTTACTGGTG ACTC CAAGAA 8 00 

CGACCCACCA AAGGGCTGCG ACTCTTTCAA CGCTCAGGTC ATCATCTTCA 850 

ACCACCCTGG TCAGATCTCT GCTGGTTACG CTCCAGTTTT GGACTGCCAC 900 

15 ACTGCTCACA TTGCTTGCAA GTTTGAGGAG CTCATTGAGA AGATTGACAG 95 0 

ACGTTCCGGT AAGAAGGT CG AAGACTCTCC TAAGTTCGTC AAGGCCGGTG 1000 

ATGCCGCCAT TGTCAAGATG GTTCCATCCA AGCCAATGTG TGTTGAAACC 1050 

TTCACTGAGT ACCCACCTCT TGGTCGTTTC GCCGTCCGTG AC 1092 
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2) INFORMATION FOR SEQ ID NO: 455 



(i ) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 1149 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophyt.es 

(B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455 



GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT GGAAGTT CGA 50 

GACCCCCAAG TACAATGT CA CCGTCATTGG TATGTTTTCT CTTTACCTTT 100 

4 0 CCCCTCCATC GTCTTGCTGT GCCATAACTA ACGAGAGTAG ACGCCCCCGG 150 

TCACCGTGAC TTCATCAAGA ACATGATCAC TGGTACCTCC CAGGCCGACT 200 

GTGCTATTCT CATCATTGCT GCCGGTACTG GTGAGTTCGA GGCTGGTATC 250 

TCCAAGGATG GCCAGACCCG TGAGCACGCT CTGCTCGCCT TCACCCTTGG 3 00 

TGTCAAGCAG CTCATCGTTG CCATCAACAA GATGGACACC ACCAACTGGT 3 50 

45 CCGAGGACCG TTTCAAGGAA ATCATCAAGG AAGTCACCAA CTTCATCAAG 4 00 

AAGGTTGGCT ACGACCCCAA GGGTGTTCCA TTCGTTCCAA TCTCTGGTTT 450 

CAACGGTGAC AACATGATTG AGGCCTCCAC CAACTGCCCA TGGTACAAGG 500 

GATGGAACAA GGAGACTAAG GCCGGTGGTG CCAAGACTGG CAAGACCCTC 550 

CTCGAGGCCA TCGATGCCAT CGACATGCCA ACCCGTCCTA CCGACAAGCC 600 

50 CCTCCGTCTC CCACTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACTG 650 

TACCAGTCGG TCGTGTTGAG ACCGGTATCA TCAAGCCCGG TATGGTCGTC 700 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TCGAAATGCA 750 

CCACCAGCAG CTTCAGCAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 8 00 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGTAACGT TGCCGGTGAC 8 50 

55 TCCAAGAACG ACCCACCATC CGGCTGTGCC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGATCGGTGC TGGTTACGCC CCAGTCCTCG 95 0 

ACTGCCACAC TGCTCACATT GCTTGCAAGT TCGCTGAGCT CCTCGAGAAG 1000 

ATTGACCGCC GTAC CGGTAA ATCCGTCGAA GCCAACCCCA AGTTCGTCAA 1050 

GTCTGGTGAT GCCGCTATCG CCAAGATGGT TCCCTCCAAG CCTATGTGCG 110 0 

60 TTGAGGCTTT CACTGACTAC CCCCCACTTG GTCGTTTCGC CGTCCGTGA 114 9 
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2) INFORMATION FOR SEQ ID NO: 456 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 

9 0 TCTTGACAAG CTTAAAGCTG AACGTGAACG TGGTATCACC ATTGATATCG 50 
20 TCTTGALAAb blii^VT^m npm^r.T^T arrzTTAPTfiT TATTGATGCT 100 



GTTCGAAACT CCTAAGTACT ACGTTACTGT TATTGATGCT 

SaGCTCACC GTGATTTCAT CAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

c^ctgcqcc ATTCTTATCA TTGCTGCCGG TGTCGGTGAA TTCGAAGCTG 200 

SSScTCoS GGAAGGTCAA ACCAGAGAAC ACGCTCTTCT CGCTTTCACC 250 

^TTGGTGTCA GACAACTTAT CATTGCCATC AACAAGATGG ACTCTGTCAA 300 

SSSSSS* SgAGATACG AAGAAATCGT CAAGGAGGCT TCCAACTTCG 350 

SSSSSSt TGGTTACAAC CCCAAGTCTG TTCCATTCGT TCCTATCTCT 400 

SStTGGAACG GTGACAACAT GTTGGAACCT ACCACCAACG CCCCATGGTA 450 

caIgggatgg ACCAAGGAAA CCAAGGCTGG TGCCACTAAG GGTATGACTC 500 

TTATTGAAGC CATTGACGCC ATTGAACCAC CAGTAAGACC ATCCGACAAG 550 

CCACTCCGTC TCCCACTCCA AGATGTTTAC AAGATTGGTG GTATCGGAAC 600 

TGTGCCAGTC GGCCGTGTCG AAAC CGGTAT CATCAAGGCC GGTATGGTCG 650 

TCACCTTTGC TCCACCAATG GTCACAACTG AAGTTAAGTC CGTTGAAATG 700 

CACCACGAAC AACTTGCTCA AGGTAACCCA GGTGACAACG TTGGTTTCAA 750 

CGTCAAGAAC GTTTCCGTTA AGGAAATCAG ACGTGGTAAC GTCTGTGGTG 800 

ACT CCAAGAA CGATCCACCA AAGGGCTGCG AATCTTTCAA CGCTCAAGTT 850 

ATCGTCTTGA ACCACCCTGG TCAAATCTCT GCTGGTTACT CTCCAGTTCT 900 

CGATTGC CAC ACTGCCCACA TTGCCTGCAG ATTCGACGAA CTCCTTGAAA 950 

AGATCGACCG TCGTTCCGGT AAGAAGATTG AAGACTCTCC AAAGTTTGTC 1000 

AAGTCTGGTG ATGCCGCTAT CGTCAAGATG ATCCCAACCA AGC CAATGTG 1050 

CGTTGAAACC TTCACTGAAT ACCCACCACT TGGT CGTTTC GCCGTCCGTG 1100 



45 

2) INFORMATION FOR SEQ ID NO: 457 

{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1085 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dennatitidis 

(B) STRAIN: WSA-229 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 
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GTTGAAGGCC GAGCGTGAGC GTGGTAT C AC CAT CGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC TATGTCACCG TCATCGACGC CCCGGGTCAT 10 0 

CGTGACTTTA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 15 0 

5 CATCTTGATC ATTGCCGCCG GTACCGGTGA ATTCGAAGCC GGTATCTCCA 200 

AGGATGGTCA GACCCGTGAG CACGCTCTGC TCGCCTACAC CTTGGGTGTC 250 

AAGCAGCTCA TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 300 

GGAGCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 3 50 

TCGGCTACAA CCCCAAGGCC GTTCCTTTCG TCCCCATCTC CGGCTTCAAC 4 00 

10 GGTGACAACA TGATTGAGGT CTCCACCAAC TGCCCGTGGT ACAAGGGATG 450 

GGAGAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTCGAGG 50 0 

CCATTGACGC CATCGACCCA CCCACCCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCTCTCC AGGATGTCTA CAAGATCTCT GGTATC GGAA CGGTTCCTGT 600 

CGGTCGTGTC GAGAC CGGTA CCATCAAGGC CGGTATGGTC GTCACCTTCG 650 

15 CTCCGGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAG 70 0 

CAGCTCGCCG AGGGTCTGCC AGGTGACAAC GTTGGCTTCA ACGTCAAGAA 750 

CGTCTCCGTC AAGGAGGTTC GTCGTGGTAA CGTTGCCGGT GACTCCAAGA 8 00 

ACGACCCGCC CAAGGGTGCC GAGTCCTTCA ACGCCCAGGT CATTGTCCTC 850 

AACCACCCTG GTCAGATCGG TGCCGGCTAC GCTCCAGTCT TGGATTGCCA 900 

2 0 CACTGCCCAC ATTGCTTGCA AGTTCGCCGA GTTGCTCGAG AAGATCGACC 950 

GTCGTACCGG AAAGTCCATC GAGAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

GATGCTGCCA TCGTCAAGAT GATTCCCAGC AAGCCCATGT GTGTCGAGGC 1050 

TTTCACCGAC TATCCTCCTC TGGGTCGTTT CGCTG 10 85 
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2) INFORMATION FOR SEQ ID NO: 458 



(i) SEQUENCE CHARACTERISTICS : 
3 0 (A) LENGTH: 4 92 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: DAL- 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458 



TGTCTTCATC CGGAATTGAT TGTGAGTCGT TCCACATGCT CACCTAGTTT 5 0 

TCGCTCGATC TTTTCACTAA CGCAAACCAT GTAGAACAAC ATTGCCAAGG 100 

4 5 CCCACGGTGG TTACTCCGTC TTCACTGGTG TTGGTGAGCG TACT CGTGAG 150 

GGTAACGATC TGTACCACGA AATGCAGGAG ACTGGTGTCA TTCAGCTCGA 200 

GGGTGAATCC AAGGTCGCAC TGGTGTTCGG ACAGATGAAC GAGCCCCCCG 250 

GTGCCCGTGC CCGTGTCGCC CTTACCGGTC TGACCATTGC CGAGTACTTC 300 

CGTGACGAGG AGGGTCAGGA CGTGCTGCTC TTCATTGACA ACATTTTC CG 350 

50 TTTCACCCAG GCCGGTTCTG AGGTGTCTGC CCTTCTCGGT CGTATCCCCT 4 00 

CTGCCGTCGG TTACCAGCCC ACCCTGGCCG TCGACATGGG TGGTATGCAG 450 

GAGCGTATCA CCACCACCAA GAAGGGTTCT ATTACCTCCG TC 4 92 



2) INFORMATION FOR SEQ ID NO: 459 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1154 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459 

GTCCGTGGTC AAGAAGTTAT TGACAC TGGT GCCCCAATTA CCATTCCTGT 50 

TGGT CGTGGT ACTCTTGGTA GAATTATCAA CGTCATTGGT GAACCAATTG 100 

ACGAACGTGG TCCTATCAAG GCTTCTAAGT ATGCTCCTAT CCATACTGAA 150 

15 CCACCAACCT TTGCTGAACA ATCTACTTCT GCTGAAGTTC TTGAAACTGG 2 00 

TATCAAGGTT GTCGATCTTC TTGCTCCTTA CGCCCGTGGT GGTAAGATTG 2 50 

GTCTTTTCGG TGGTGCTGGT GTCGGTAAGA CTGTCTTCAT TCAAGAACTT 3 00 

ATTAACAACA TTGCCAAGGC TCACGGTGGT TTCTCTGTCT TCACTGGTGT 3 50 

CGGTGAAAGA AC C CGTGAAG GTAACGATCT TTACCGTGAA ATGAAGGAAA 4 00 

2 0 CTGGTGTCAT CAACCTCGAA GGTGACTCCA AGGTCGCTCT CGTTTTCGGT 4 50 

CAAATGAACG AACCTCCAGG TGCCCGTGCC CGTGTCGCTT TGACTGGTCT 5 00 

TACCATTGCC GAATACTTCC GTGATGAAGA AGGACAAGAT GTCTTGCTTT 550 

TCGTTGACAA CATTTTCAGA TTCACCCAAG CCGGTTCTGA AGTCTCTGCT 6 00 

CTTTTGGGTC GTATTCCATC TGCCGTCGGT TACCAACCTA CCCTTGCTAC 650 

2 5 CGATATGGGT GCCCTCCAAG AACGTATTAC CACCACCCAA AAGGGTTCCG 7 00 

TCACATCTGT CCAAGCCGTC TATGTCCCAG CAGACGATTT GACTGATCCT 7 50 
GCCCCAGCCA CCACTTTCGC TCACTTGGAC GCCACCACTG TCTTGTCTCG 8 00 

TTCCATTTCC GAATTGGGTA TCTACCCAGC TGTCGATCCT CTCGATTCCA 850 
AGTCTCGTCT TTTGGATCCT GAAGTTATTG GACACGAACA CTACGAAGTT 90 0 

3 0 GCCACTCAAG TTCAACAAAC CCTCCAAGCT TACAAGTCTC TC CAAGAT AT 950 

CATTGC CATT TTGGGTATGG ATGAATTGTC TGAAGC TGAT AAGCTTACTG 100 0 

TCGAACGTGC CCGTAAGATC CAAAGATTCC TTTCCCAACC ATTCGCTGTT 105 0 

GCCGAAGTTT TCACTGGTAT CGAAGGTCGT CTCGTTCCAT TGAAGGAAAC 110 0 

CGTCAGATCT TTCAAGGAAA TCCTTGAAGG TAAGTACGAT CACCTTCCAG 115 0 

35 AAGC 1154 



40 



2) INFORMATION FOR SEQ ID NO: 460 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 

55 CCAATTCGAC GAAGGAAACT TGCCAGCTAT TTTGAATGCT TTGACTTTGA 5 0 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCTCAACA TTTGGGTGAA 10 0 

AACACCGTCA GAGCTATTGC TATGGATGGT ACTGAAGGTT TAGTCAGAGG 150 

TACCGAAGTC AACGATACCG GTGCCCCAAT CTCCGTCCCA GTCGGTAGAG 200 

GTACTTTAGG TAGAATCATC AATGTTGTTG GTGAACCAAT TGATGACAGA 2 50 

60 GGTCCAATTG AATGTAAGGA AAA G AAA CCA ATTCACGCTG AACCACCATC 3 00 



411 



CA 022834S8 1999-09-28 



ATTCGTTGAA CAATCCACTG CTGCCGARAT TTTGGAAACC GGTATCAAGG 3 50 

TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAARAT TGGTTTATTC 400 
GGTGGTGCTG GTGTCGGTAA GACCGTCTTT AT CCAAGAAT TGATTAACAA 450 
CATTGCCAAA GCCCATGGTG GTTTCTCTGT CTTTACCGGT GTYGGTGAAA 50 0 

5 GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAAGA AACCGGTGTC 550 
ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTCTTCG GTCAAATGAA 60 0 

CGAACCACCA GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 65 0 

CTGAATACTT CAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTCATTGAT 70 0 

AACATTTTCA GATTCACCCA AGCTGGTTCC GAAGTGTCTG CTTTGTTAGG 75 0 

10 TCGTATTCCA TCTGCTGTCG GTTATCAACC AACCTTAGCC ACTGATATGG 800 
GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAAGGTTC CGTCACCTCT 85 0 

GTCCAAGCTG TCTATGTCCC AG CTGATGAT TTGACCGATC CTGCTCCAGC 90 0 

CACTACATTC GCCCATTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 95 0 

CTGAATTGGG TATCTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 1000 

15 TTATTGGACG CTTCTGTTGT TGGTCAAGAA CATTACGATG TCGCTACTGG 1050 

TGTTCAACAA ACTTTACAAG CTTACAAATC CTTACAAGAT ATCATTGCTA 110 0 

TTTTGGGTAT GGATGAATTG TCTGAAGCTG ATAAATTGAC TGTCGAAAGA 115 0 

GCCCGTAAGA TCCAAAGATT CTTGTCTCAA CCATTCGCTG TTG CTGAAGT 12 0 0 

TTTCACTGGT AT CCCAGGTA GATTAGTCAG ATTGCAAGAC ACTGTCAAAT 12 50 

20 CATTCAAGGA TGTTTTGGAA GGTAAATACG ATAACTTGCC AGAAA 12 95 



2) INFORMATION FOR SEQ ID NO: 461 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 5 (A) ORGANISM: Candida, dubliniensis 

(B) STRAIN: NCPF 3 94 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461 

4 0 TAACTTGCCA GCTATTTTGA ATGCTTTGAC TTTGAAGAAC GGTGACCAAG 50 

ATTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC CGTCAGAGCT 100 

ATTGCTATGG ATGGTACTGA AGGTTTAGTC AGAGGTACTG AAGTCAACGA 15 0 

TACCGGTGCC CCAATCTCCG TTCCAGTCGG TAGAGGTACC TTAGGTAGAA 2 00 

TCATCAATGT TGTTGGTGAA CCAATTGATG ACAGAGGTCC AATTGAATGT 250 

45 AAGGAAAAGA AACCAATTCA TGCAGAACCA CCATCCTTCG TTGAGCAATC 3 00 

CACTGCTGCC GAAATTTTGG AAACCGGTAT CAAGGTTGTC GACTTATTGG 35 0 

CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG TGCTGGTGTC 4 00 

GGTAAGACCG TCTTTATCCA AGAATTGATT AACAACATTG CTAAAGCCCA 4 50 

TGGTGGTTTC TCCGTCTTTA CCGGTGTCGG TGAAAGAACC AGAGAAGGTA 500 

5 0 ACGATTTGTA CCGTGAAATG AAAGAAACCG GTGTCATCAA CTTAGAAGGT 550 

GACTCCAAGG TCGCTTTGGT CTTTGGACAA ATGAACGAAC CACCAGGTGC 600 

TAGAGCTAGA GTTGCTTTGA CTGGTTTGAC TATTGCTGAA TACTTCAGAG 65 0 

ATGAAGAAGG TCAAGATGTC TTGTTGTTCA TCGATAACAT TTTCAGATTC 70 0 

ACCCAAGCTG GTTCCGAAGT GTCTGCTTTG TTAGGTCGTA TTCCATCTGC 7 50 

55 CGTCGGTTAT CAACCAACCT TAGCTACTGA TATGGGTCTT TTGCAAGAAC 8 00 

GTATTACCAC CACCAAGAAA GGTTCCGTCA CCTCTGTCCA AGCTGTCTAT 850 

GTCCCAGCTG ATGATTTGAC CGATCCTGCT CCAGCCACCA CATTCGCCCA 90 0 

TTTGGATGCC ACTACTGTCT TGTCTAGAGG TATTTCTGAA TTGGGTATTT 950 

ACCCAGCTGT CGATCCATTG GATTCCAAAT CCAGATTATT GGACGCTGCC 10 00 

6 0 GTTGTTGGTC AAGAACATTA TGATGTCGCT ACTGGTGTTC AACAAACTTT 1050 
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GCAAGCTTAC AAATCCTTAC AAG AT AT CAT 
AATTGTCTGA AGCTGATAAA TTGACTGTCG 
AGATTCTTGT CTCAACCATT CGCCGTTGCT 
AGGTAGATTA GTCAGATTGC AAGAC AC TGT 
5 TGGAAGGTAA ATACGATCAC TTGCCAG 



TGCTATTTTG GGTATGGATG 1100 
AAAGAGC CCG TAAGATT CAA 1150 
GAAGTTTTCA CTGGTATTCC 12 00 

CAAATCATTC AAGGATGTTT 12 50 

1277 



2) INFORMATION FOR SEQ ID NO: 462 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 

25 GTAACTTGCC AGCTATTTTG AACGCTTTGA CCTTGAAGAA CGGTGAAAAC 50 

GAC TTAGTTT TAGAAGTTGC CCAACATTTG GGTGAAAACA CCGTCAGAGC 100 

TATTGCTATG GATGGTACTG AAGGTTTAGT TAGAGGTACT CCAGTTACCG 150 

ATTCTGGTGC TCCAATTTCT GTCCCAGTCG GTAGAGGTAC TTTAGGTAGA 2 00 

ATCTTGAACG TTATTGGTGA ACCAATTGAT GAACAAGGTC CAGTTGATGC 250 

30 CAAGGAAACC AGAC CAATTC ACCAAGACCC ACCAGCATTC GTTGATCAAT 3 00 

CCACCAAGGC TGAAGTTTTG GAAACTGGTA TCAAGGTTGT CGATTTATTA 350 

GCCCCTTACG CTAGAGGTGG TAAGATTGGT TTATTCGGTG GTGCCGGTGT 4 00 

CGGTAAGACC GTCTTTATCC AAGAATTGAT TAACAACATT GCCAAGGCCC 450 
ATGGTGGTTT CTCTGTTTTC ACTGGTGTCG GTGAAAGAAC CAGAGAAGGT 500 

3 5 AACGATTTAT ATAGAGAAAT GAAGGAAACT GGTGTCATTA ACTTGGAAGG 550 

TGACTCCAAG GTCGCCTTGG TTTTCGGTCA AATGAACGAA CCACCAGGTG 600 
CTAGAG CTAG AGT TGCTTTA AC CGGTTTAA CCATTGCCGA ATACTTCAGA 650 
GAC GAAGAAG GTCAAGATGT GTTATTGTTC GTCGATAACA TTTTTAGATT 700 
CACCCAAGCC GGTTCCGAAG TGTCTGCTTT GTTAGGTCGT ATTCCATCGG 750 

4 0 CTGTCGGTTA TCAACCAACC TTAGCCACTG ATATGGGTCT TTTACAAGAA 800 

AGAATTACCA CCACCACCAA GGGTTCCGTT ACTTCTGTCC AAGCTGTCTA 850 
CGTCCCAGCC GATGATTTAA CCGATCCTGC TCCAGCTACC ACTTTCGCCC 900 
ACTTGGATGC TACCACTGTG TTGTCTCGTG GTATCTCTGA ATTGGGTATT 950 

TACCCAGCTG TCGATCCATT GGATTCCAAA TC CAGATTGT TAGATGCTGA 100 0 

4 5 TATCGTTGGT AAAGAACACT ACGAAGTTGC CACTGGTGTC CAACAAACCT 1050 

TACAAGCTTA CAAATCTTTA CAAGATATCA TTGCTATTTT AGGTATGGAT 1100 

GAATTGTCTG AAGCCGATAA ATTGACTGTC GAAAGAGCCA GAAAGATCCA 1150 

AAGATTCTTG TCTCAACCAT TCGCCGTTGC CGAAGTTTTC ACCGGTATCC 12 0 0 

CAGGTAGATT AGTTAGATTG CAAGACACTG TTAAATCTTT CAAGGAAGTC 1250 

50 TTAGAAGGTA AATATGATCA CTTACCAG 127 8 



2) INFORMATION FOR SEQ ID NO: 463 

55 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1154 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
6 0 <D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463 



10 TCTGGTCAGA GGC GAGAAGG TCGTCGACAC AGGTGCCCCA ATCTCCGTCC 5 0 

CTGTCGGCAG AGAGACCCTG GGCAGAATCA TCAACGTTAT CGGTGAACCT 100 

ATCGACGAGC GTGGCCCAAT CAACTCAAAG TTGAGAAAGC CTATCCACGC 150 

CGACCCTCCT TCCTTCGCAG AACAGTCCAC CGCCGCCGAA GTCCTGGAAA 200 

CAGGTATCAA GGTCGTCGAC TTGCTGGCCC CTTACGCCAG AGGTGGTAAG 2 50 

15 ATCGGTCTGT TCGGTGGTGC CGGTGTCGGT AAGACCGTGT TCATCCAAGA 3 00 

ACTGATCAAC AACATCGCAA AGGCTCACGG TGGTTTCTCC GTGTT CAC AG 3 50 

GTGTCGGTGA AAGAACCAGA GAAGGTAACG ATTTGTACAG AGAAATGAAG 4 00 

GAAACCGGTG TCATCAACTT GGAAGGTGAC TCTAAGGTCG CCTTGGTCTT 450 

CGGCCAAATG AACGAACCAC CAGGAGC CAG AGCCAGAGTC GCCTTGACCG 500 

20 GTTTGACCAT CGCAGAATAC TTCAGAGATG AAGAAGGTCA AGATGTCCTG 550 

CTGTTCGTCG ACAACATTTT CAGATTCACC CAAGCCGGTT CAGAAGTCTC 600 

CGCTTTGCTA GGTCGTATCC CATCCGCCGT CGGTTATCAA CCAACCTTGG 650 

CCACCGATAT GGGTCTGTTG CAAGAAAGAA TTACCACCAC AAAGAAGGGT 70 0 

TCCGTCACTT CCGTCCAAGC CGTCTACGTG CCTGCAGATG ATTTAACAGA 750 

2 5 TCCTGCCCCT GCCACTACTT TCGCGCACTT GGACGCCACC ACCGTCTTGT 800 

CCAGAAGTAT CTCAGAATTG GGTATCTACC CAGCTGTCGA CCCATTGGAC 850 

TCCAAGTCTA GATTG CTAGA CGCTGCCGTT GTCGGTGAAG AGCATTACAA 900 

CGTCGCCACA AAGGT CCAAG AAACTTTACA AACTTACAAG TCTCTGCAAG 950 

ATATCATCGC CATTTTGGGT ATGGATGAAT TGTCCGAACA AGATAAGCTA 100 0 

3 0 ACTGTCGAAA GAGCAAGAAA GATCCAAAGA TTCTTGTCCC AACCATTCGC 105 0 

TGTCGCTGAA GTTTTCACCG GTATCGAAGG TAAGCTGGTC AGATTGAAGG 110 0 

ACACCATCTC CTCTTTCAAG GCTGTCTTGG AAGGTAAGTA CGATGATCTT 115 0 

CCAG 1154 

35 



2) INFORMATION FOR SEQ ID NO: 464 

(i ) SEQUENCE CHARACTERISTICS : 
4 0 (A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



4 5 (ii) MOLECULE TYPE: Genomic DNA . 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 

(B) STRAIN: ATCC 6260 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 64 



CCACTACGAG GACGGTAACC TTCCTGCTAT TTTCAACGCC TTGACTCTTA 50 

AGAACGGTGA CCAAAACTTG GTTTTGGAAG TTGCCCAGCA TTTGGGTGAA 100 

55 AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TGGTTAGAGG 150 

TGCCAGCGTC AC TGACACTG GTGCTCCTAT CTCTGTGCCT GTTGGTCGTG 2 00 

GTACTTTGGG TCGTATCATC AACGTTATTG GTGAGCCAAT TGACGAGCGT 2 50 

GGACCAATCG AGTCCAAGCA AAAGAAGCCC ATTCACGCTG AACCACCATC 3 00 

GTTCGTCGAA CAATCCACTT CTGCCGAGGT TTTGGAAACC GGTATCAAGG 350 

60 TTGTCGACTT GTTGGCTCCA TACGC CAGAG GTGGTAAGAT TGGATTGTTC 4 00 
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GGTGGTGCCG 
CATTGCCAAG 
GAACCAGAGA 
ATCAACTTGG 
5 CGAACCTCCA 
CTGAATACTT 
AACATTTTCA 
TCGTATTCCT 
GTTTGTTGCA 

10 GTC CAAGCTG 
TACTACTTTT 
CCGAGTTGGG 
TTGTTGGATG 
CGTTCAACAA 

15 TTTTGGGTAT 
GCCCGTAAGA 
TTTCACTGGT 
CTTTCAAGGA 



GTGTCGGTAA 
GCTCACGGTG 
AGGTAACGAT 
AAGGTGAATC 
GGAGCTAGAG 
CAGAGATGAG 
GATTCACTCA 
TCGGCTGTCG 
AGAG CGTATT 
TCTATGTGCC 
GCTCACTTGG 
TATTTACCCA 
CCTCTGTTGT 
ACCTTGCAAG 
GGACGAATTG 
TCGAGAGATT 
ATCAGTGGTA 
AGTCTTGGAA 



GACTGTGTTT 
GTTTCTCCGT 
TTGTACCGTG 
CAAGGTGGCC 
CCAGAGTTGC 
GAGGGTCAAG 
AGCTGGTTCT 
GTTACCAACC 
ACCACCACCA 
AGCCGATGAT 
ATGCTAC CAC 
GCTGTCGATC 
CGGTGAGGAG 
CTTACAAGTC 
TCGGAAGCTG 
CTTGTCTCAA 
AGTTGGTCAG 
GGTAAGTACG 



ATCCAAGAGT 
GTTCACCGGT 
AAATGAAGGA 
TTGGTGTTCG 
CCTTACCGGT 
ATGTGTTGTT 
GAAGTGTCGG 
TACTTTGGCC 
AGAAGGGTTC 
TTGACCGATC 
TGTGTTGTCT 
CTTTGGATTC 
CACTACTCGG 
TTTGCAAGAT 
ACAAGTTGAC 
CCATTTGCTG 
ATTGGAGGAC 
ATCACTTGCC 



TGATTAACAA 4 50 

GTCGGTGAAA 50 0 

AACTGGTGTC 550 

GT CAAATGAA 600 

TTGACCATCG 650 

GTTCGTCGAC 70 0 

CTTTGTTGGG 750 

AC CGATATGG 800 

CGTCACCTCT 85 0 

CTGCTCCTGC 900 

AGAGGT AT CT 95 0 

CAAGTCGAGA 1000 

TTGCTTCTAA 1050 

ATCATTGCCA 1100 

CGTCGAGAGA 1150 

TTGCCGAAGT 1200 

ACTATCAGAT 1250 

AGA 12 93 



20 



25 



30 
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2) INFORMATION FOR SEQ ID NO: 465 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonll 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 



CACTTTGACG 
GAACGGTGAC 

4 0 ACACCGTCAG 
GAGGCTGTCA 
GACTTTGGGT 
GACCAATCAA 
TTCGTTGAGC 

45 TGTCGACTTG 
GTGGTGCCGG 
ATTGCCAAGG 
AACCAGAGAA 
TCAACTTCGA 

50 GAGCCACCAG 
CGAATACTTC 
ACATTTTCAG 
CGTATT C CAT 
TACCTTGCAA 

55 TCCAGGCCGT 
ACCACTTTCG 
CGAGTTGGGT 
TGTTGGATGC 
GTC CAGCAGA 

60 TTTGGGTATG 



ATGGTAACTT 
CAGGACTTGG 
AACCATTGCC 
CTGACACTGG 
CGTATCATCA 
GTCCAAGAAG 
AGTCTACTTC 
TTGGCCCCTT 
TGTCGGTAAG 
* CCCACGGTGG 
GGTAACGATT 
GGGTGACTCC 
GAGC TAGAGC 
AGAGATGAAG 
ATTCACTCAG 
CTGCTGTCGG 
GAAAGAATTA 
TTACGTGCCA 
CTCACTTGGA 
ATCTACCCAG 
CTCTGTTGTC 
CCTTGCAGGC 
GATGAATTGT 



GCCAGCCATT 
TCTTGGAGGT 
ATGGACGGTA 
TGCTCCAATC 
ACGTTATTGG 
AGAAACCCAA 
TGCTGAGGTT 
ACGCCAGAGG 
ACCGTCTTTA 
TTTCTCTGTC 
TGTAC CGTGA 
AAGGTCGCCT 
TAGAGTTGCT 
AAGGT CAGGA 
GCTGGTTCCG 
TTACCAGCCA 
CCACCACCAA 
GCTGATGATT 
TGCTACCACT 
CTGTCGACCC 
GGTAAGGAGC 
CTACAAGTCC 
CCGAAGCCGA 



TTCAACGCCT 
CGCCCAGCAC 
CCGATGGTTT 
TCCGTGCCTG 
TGAGC CAATT 
TCCACACTGA 
TTGGAGACTG 
TGGTAAGATT 
TCCAAGAGTT 
TTTACCGGTG 
AATGCAGGAG 
TGGTCTTCGG 
TTGAC CGGTT 
TGTGTTGTTG 
AGGTGTCCGC 
ACCTTGGCCA 
GAAGGGTTCC 
TGACCGATCC 
GTGTTGTCTA 
ATTGGACTCC 
ACTACGATGT 
TTGCAGGATA 
CAAGTTGACT 



TGAAGTTGAA 5 0 

TTGGGTGAGA 10 0 

GGTCAGAGGC 15 0 

TTGGTCGTGA 200 

GACGAGAGAG 250 

CCCACCAACC 3 0 0 

GTATTAAGGT 3 50 

GGTTTGTTCG 4 0 0 

GATTAACAAC 450 

TCGGTGAGAG 50 0 

ACCGGTGTCA 550 

TCAGATGAAC 60 0 

TGACCATTGC 650 

TTCGTTGACA 700 

CTTGTTGGGT 750 

CCGATATGGG 800 

GTCACCTCTG 850 

TGCCCCAGCT 90 0 

GAGGTATTTC 950 

AAGTCTAGAT 1000 

TGCTACCAAC 1050 

TCATTGCCAT 1100 

GTCGAGAGAG 1150 
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CTAGAAAGAT TCAGAGATTC TTGTCCCAGC CATTCGCCGT CGCCGAGGTT 12 00 

TTCACTGGTA TCGAGGGTAA GTTGGTCAGA TTGGAGGAGA CCGTCAAGTC 1250 
TTTCAAGGAG GTCTTGGATG GTAAGTACGA CCACTTGCCA GAG 12 93 



2) INFORMATION FOR SEQ ID NO: 466 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 

(B) STRAIN: ATCC 16783 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 66 



AACGGTGGAT C TAAGTTAGT CTTAGAAGTT GCTCAACATT TGGGTGAAAA 50 
CACTGTCAGA ACCATTGCTA TGGATGGTAC TGAAGGTTTA GTTAGAGGTC 100 

2 5 AACCAGTTAA TGATACTGGT GCTCCAATCT CTGTCCCAGT CGGTAGAGGT 150 

ACTTTAGGTA GAATCTTAAA CGTTATTGGT GATCCAGTCG ATGAAAGAGG 2 00 

TCCAATTGAT TGTAAGGAAA GAAAACCAAT TCATCAAGAT CCTCCTGCTT 250 
TCGTTGAACA ATCTACTGAA GCTGAAGTTT TAGAAACCGG TATTAAGGTT 3 00 

GTTGATTTAT TAGCTCCTTA CGCTAGAGGT GGTAAGATTG GTTTATTCGG 350 

3 0 TGGTGCTGGT GTCGGTAAAA CCGTTTTCAT TCAAGAATTA ATTAACAATG 400 

TTGCAAAGGC TCATGGTGGT TTCTCAGTTT TCACTGGTGT CGGTGAAAGA 450 
ACCAGAGAAG GTAATGATTT ATACAGAGAA ATGAAGGAAA CTGGTGTTAT 50 0 

TAACTTAGAA GGTGAATCTA AGGTCGCCTT AGTTTTCGGT CAAATGAATG 55 0 

AACCACCAGG AGCAAGAGCA AGAGTTGCTT TAACCGGTTT AACTATTGCT 600 

3 5 GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCATTGATAA 65 0 

CATTTTCAGA TTTACTCAAG CAGGTTCTGA AGTTTCTGCA TTGTTAGGTA 700 
GAATTCCATC CGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 75 0 

CTTTTACAAG AAAGAATTAC AACTACTAAG AAAGGTTCCG TTACTTCTGT 80 0 

CCAAGCAGTT TATGTCCCAG CAGATGATTT AACTGATCCT GCTCCAGCAA 850 

4 0 CTACTTT CGC C C AC TTAGAT GCAACTACTG TCTTATCTAG AGGTATTTCC 90 0 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATT CTA AATCTAGATT 950 

ATTAGATGTT GCTGTTGTTG GTCAAGAACA TTATGATGTT GCAACTCAAG 100 0 

TTCAAGAAAC TTTACAAGCA TACAAATCTT TACAAGATAT TATTGCTATT 1050 

TTAGGTATGG ATGAATTATC TGAACAAGAT AAATTAACCG TTGAAAGAGC 110 0 

45 AAGAAAGATT C 1111 



50 



2) INFORMATION FOR SEQ ID NO: 467 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 83 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
6 0 (A) ORGANISM: Candida kefyr 
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(B) STRAIN: ATCC 28838 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 

5 TCGAACAAGG TCAATTGCCA GAAATTTTGA ACGCTTTGGA GATTGAWACT 50 

CCTCAAGGTA AGTTGGTTTT GGAAGTTGCC CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTGGTC CGTGGTGAGA 150 

AGGTTTTGGA CACTGGTGCT CCAATTTCCG TCCCAGTCGG TAGAGAAACT 200 

TTGGGTAGAA TCATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGCCC 250 

10 H?2SgTCC AAGATGAGAA AGCCAATTCA CGCTGACCCT CCATCCTTTG 300 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACCGGTAT CAAGGTTGTC 350 

GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 4 00 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAGTTGATT AACAACATTG 4 50 

CCAAGGCCCA TGGTGGTTTC TCCGTCTTCA CCGGTGTCGG TGAAAGAACC 500 

15 AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACCG GTGTCATCAA 550 

CTTGGAAGGT GACTCCAAGG TCGCCTTGGT CTTCGGTCAA ATGAACGAAC 600 

CACCTGGAGC TAGAGCCAGA GTTGCCTTGA CCGGTTTGAC TATCGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTG TTGTTGTTTA TCGACAACAT 700 

TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

2 0 TTCCATCCGC TGTCGGTTAC CAACCTACTT TGGCCACCGA TATGGGTTTG 800 

TTGCAAGAAA GAATTACCAC T AC CAAGAAG GGTTCCGTTA CCTCCGTCCA 850 

AGCTGTCTAC GTCCCTGCTG ATGATTTGAC TGATCCTGCT CCAGCTACYA 900 

CTTTCGCCCA TTTGGACGCC ACCACCGTGT TGTCCAGAGG TATCTCCGAA 950 

TTGGGTATCT ACCCAGCTGT CGATCCATTG GATTC CAAGT CTAGATTGTT 1000 

2 5 GGACGCTGCC GTTGTCGGTC AAGAACATTA CGACGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

GGTATGGATG AATTGTCTGA ACAAGACAAG TTGACTGTCG AAAGAGCCAG 1150 

AAAGATCCAA AGATTCTTGT CTCAACCATT CGCCGTCGCC GAAGTTTTCA 12 00 

CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CGCTTCCTTC 1250 

3 0 AAGGCTGTTT TGGAAGGTAA GTACGATCAC TTG 12 83 



35 



2) INFORMATION FOR SEQ ID NO: 468 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1287 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468 

50 CTTCGAACAA GGCCAATTAC CACAAATTTT AAACGCTTTA GTTATGGATA 50 

ATGGTGGTAA CAAGTTAGTT TTAGAAGTTG CTCAACATTT AGGTGAAAAC 100 

ACTGTCAGAA CCATTGCTAT GGATGGTACT GAAGGTTTAG TTAGAGGTCA 150 

AAC CGTTAAC GATACCGGTG CTCCAATCTC TGTCCCAGTT GGTAGAGGTA 2 00 

CCTTAGGTAG AATCTTGAAC GTCATTGGTG ATCCAGTCGA TGAAAGAGGT 2 50 

55 CCAGTTGACT GTAAGGAAAG AAAGC CAATT CACGCTGATC CTCCAGCTTT , 3 00 

CGTTGAACAA TCCACTGAAG CTGAAGTTTT GGAAACTGGT ATTAAGGTTG 3 50 

TCGATTTATT AGCAC CTTAC GCAAGAGGTG GTAAGATTGG TTTATTCGGT 4 00 

GGTGCTGGTG TTGGTAAGAC CGTTTTTATC CAAGAATTGA TCAACAATGT 4 50 

YGCAAAGGCT CATGGTGGTT TCTCCGTTTT CACTGGTGTT GGTGAAAGAA 500 

60 CCAGAGAAGG TAACGATTTA TACAGAGAAA TGAAGGAAAC TGGTGTTATT 550 
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AACTTGGAAG GTGAATCCAA GGTCGCCTTA GTTTTCGGTC AAATGAACGA 
rCTAGAGCAA GAGTTGCTTT AACTGGTTTG ACCATTGCAG 

SSESSS agatgaagaa GGTCAAGATG tcttgttatt cattgataac 
mSSaSS ?cacccaagc aggttctgaa gtctctgcat tattaggtag 
S?cca?c? gctgtcggtt atcaaccaac tttagcaacc gatatgggtc 

AAGAATTACC ACCACCAAGA AGGGTTCCGT TACTTCTGTC 

?II?c5gttt atgtcccagc cgatgattta accgatcctg ctccagctac 
™Scgcc cacttggatg caaccactgt cttgtcyaga ggtatttccg 

ItSS^S CTACCCAGCT GTCGATCCAT TAGATTCTAA GTCTAGATTA 

ttagatgttg SSSSttgg ?caagaacat tatgaagttg caactcaagt 

IT^ltlrT TTACAAGCTT ACAAGTCTTT ACAAGATATT ATTGCTATTT 
?rS5?A^SS TGAATTATCT GAACAAGATA AGTTAACYGT TGAAAGAGCA 
I?SSSI?CC AAAGATTCTT ATCTCAACCA TTCTCTGTTG CAGAAGTTTT 

SSSSLot tagtcagatt agaagaaacc atcaagtctt 

TCAGGGATGT TCTTGCAGGT AAGTACGATC ACTTACC 



2) INFORMATION FOR SEQ ID NO: 4 69 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Candida lambica 

(B) STRAIN: ATCC 24750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469 

-rrraarAAGG TCAATTACCA CCAATTCTTA ACGCTTTGGT CATGGAAAAC 50 

S?g5??aaa agttJgtttt GGAAGTTGCT CAACATTTGG GTGAAAACAC 100 

?OTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTAGTT AGAGGTCAAC 150 

CTGTTAACGA CACTGGTGCT CCAATCTCTG TCCCAGTTGG TAGAGGTACT 200 

CTTGGTAGAA TCTTGAACGT CACTGGTGAC CCAGTYGATG AAAGAGGTCC 250 

TCTCGftMTGT AAGGAGAGAA GACCAATTCA CCAAGACCCA CCTGCTTTCG 3 00 

??SaCCAATC CACTGAAGCT GAAGTTTTGG AAACCGGTAT TAAGGTTGTC 3 50 

5I?TTATTAG CACCTTACGC TAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCTGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAATGTTG 4 50 

CTAAGGCCCA CGGTGGTTTC TCCGTTTTCA CTGGTGTCGG TGAAAGAACC 500 

4 5 AGAGAAGGTA ACGATTTATA CAGAGAAATG AAGGAAACCG GTGTTATTAA 55 0 

CCTTGAAGGT GAATCTAAGG TCGCCCTCGT TTTCGGTCAA ATGAACGAAC 600 

CACCAGGAGC AAGAGCTAGA GTTGCTCTTA CTGGTTTGAC CATTGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGCTTTTCA TTGACAACAT 700 

ACCCAAGCAG GTTCCGAAGT GTCTGCTTTG TTAGGTAGAA 750 

50 TTCCWTCTGC CGTTGGTTAC CAACCAACCT TAGCAACCGA TATGGGTTTG 80 0 

TTACAAGAAA GAATTACCAC CACCAAGAAG GGTTCCGTCA CCTCCGTCCA 850 

AGCTGTTTAC GTCCCAGCTG ATGATTTRAC TGACCCTGCA CCAGCAACCA 900 

CTTTCGCCCA CTTGGATGCT ACCACCGTCT TGTCTAGAGG TATTTCCGAA 950 

TTAGGTATTT ACCCAGCTGT CGATC CATTA GATTCTAAGT CTAGATTATT 1000 

^5 GGATGTCGCT GTTGTCGGTC AAGAACATTA TACCGTTGCA AC YCAAGTC C 1050 

AAGAAACTTT ACAAGCTTAC AAGTC CTTAC AAGATATCAT TGCTATTTTG 1100 

GGTATGGACG AATTATCTGA ACAAGATAAG CTTACTGTTG AAAGAGCAAG 1150 

AAAGATCCAA AGATTCCTTT CCCAACCTTT CTCCGTCGCA GAAGTTTTCA 1200 

CTGGTATCCC AGGTAAGCTT GTCAGATTAG AAGAAACCAT TATTTCTTTC 1250 

AGAGATGTCC TCGATGGTAA GTACGACCAC TTA 1283 



35 



40 



60 
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2) INFORMATION FOR SEQ ID NO: 470 

( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi ) ORIGINAL SOURCE: 
15 (A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470 

SEES Sffi ^ £ 

200 
250 
300 

25 TTCGGTGGTG j.™ — ;^ TTTrArT GGTGTCGGTG 350 

400 
450 
500 

30 TTGCCGAGTA ^f™*^ J^"™— wnllmhl -TGCTTTGTT 600 

650 
700 
750 

35 AGCCACCACC TTL^li r^^A CTCCAAGTCT 850 

900 
950 
1000 
1050 

4C AGAGCCAGAA A^TL = ^^i™^ ^ttp^G GACACTGTCA 1100 

1140 



45 



asss sSs ssss sssss 

s IS lil ils IEE 

^SgACTA CTTCAGAGAC GAAGAGGGCC AAGATGTCTT GTTGTTCGTT 
2^£S?TT TCAGATTCAC CCAGGCCGGT TCTGAAGTGT CTGCTTTGTT 
SSSSS SSSSSS TCGGTTAC CA ACCAACCTTG ? CCACCGATA 
^nrr-nrrrTTT G CAAGAG AG A ATTACCACCA CCAAGAAGGG TTCCGT CACC 
?2?g?2SIg 2Stttatgt TCCAGCTGAT GACTTGACTG accctgctcc 
TSSSrAT? TTCGCCCACT TGGACGCCAC CACTGTGTTG TCCAGAGGTA 
tSSSSS ggg?a?Sa? CCAGCTGTCG ACCCATTGGA ctccaagtct 
ISStgttgg acgcttctat tgttggtaag gagcactacg aagttgcttc 
tS?ot??S caaactttgc aagcttacaa gtctttgcaa gatatcattg 
?S????gSg 5atggatgaa ttgtctgagg ctgacaagtt gaccgttgag 
aSSSgaa agatccaaag attcttgtct caaccattcg ctgttgccga 

S^SSSSS GGTATCCCAG GTAGATTGGT CAGATTGGAG GACACTGTCA 
GATCCTTCAA GGAAGTTTTG GACGGTAAGT AC GAC CACTT 



2) INFORMATION FOR SEQ ID NO: 471 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 129 6 bases 
50 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensxs 

(B) STRAIN: ATCC 22 977 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471 
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SS EES SEE ^ 2S~§ | 

>. is III si III s | 

S5S2£££ 2SS5ES3 SEESSS aIItgaagga aactggtgtt 550 

GAACTAGAGA AGGTAAC^Ai ^rrrrGCA TTAGTTTTCG GTCAAATGAA 6 00 

SSSSSil | 

^SS?CA" tCCGCTGTCG GTTATCAACC AACTTTAGCA ACTGATATGG 800 

SSmSS AGAAAGAATT ACTACTACTA AGAAAGGTTC TGTTACTT CC 850 

S?caagcag TTTATGTCCC AGCAGATGAT TTAACTGATC CTGCTCCAGC 900 

™ ?I?SS??TTC GCCCACTTAG ATGCAACTAC TGTCTTATCT AGAGGTATTT 950 

20 ^SaSS? ?at5taccca GCTGTCGATC CATTAGATTC TAAATCTAGA 1000 

SSSSot tStCAAGAA CATTATGATG TTGCAACTCA 1050 

2J?5££2 actttacaag cttacaaatc tttacaagat attattgcta 1100 

™£gGTAT GGATGAATTA TCTGAACAAG ATAAATTAAC CGTTGAAAGA 1150 

g?Sgaaaaa ttcaaagatt cttatctcaa cctttctctg TCGCAGAAGT 1200 

TTTCACTGGT ATCCCAGGTA AGCTTGTTAG ATTAGAAGAA ACTATTTCTT 1250 

SSraSSX TGTCTTAGCA GGTAAGTACG ATCACTTACC AGAAAA 1296 



30 



~2) INFORMATION FOR SEQ ID NO: 472 



( i ) SEQUENCE CHARACTER I STI CS : 
(A) LENGTH: 12 97 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORI GINAL SOURCE : 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 90018 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472 

ACACTTCGAA GAAGGAAACT TGCCAGCTAT TTTGAACGCT TTGACCTTGA 50 

AAAACGGTAA ACAAGACTTG GTCTTGGAAG TTGCCCAACA TTTGGGTGAA 100 

AACACCGTCA GAGCAATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

50 tOTTCTOTC ACTGACACTG GTGCCCCAAT TTCTGTCCCA GTTGGTAGAG 200 

GTACTTTGGG TAGAATCATC AATGTTACTG GTGACC CAAT TGATGAAAGA 250 

GGTCCAATTG AATGTAAGAA GAGAAACCCA ATCCACGCTG AACCACCTTC 3 00 

ATTCATTGAA CAATCCACTG CTGCTGAAGT TTTGGAAACC GGTATTAAGG 3 50 

TTGTCGACTT GTTGGCTCCA TACGCTAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

55 GGTGGTGCCG GTGT CGGTAA AACCGTGTTT ATC CAAGAAT TGATCAACAA 4 50 

CATTGC CAAG GCACATGGTG GTTTCTCGGT TTTCACTGGT GTTGGTGAAA 500 

GAAC CAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTT 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTTTTCG GTCAAATGAA 600 

CGAACCACCT GGAGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

60 CTGAATACTT TAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 700 
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GTCTTTTGCA AGAACGTATT TC CTGCTCC AGC 900 

ssssss lis ? seeks ™g g^ss sss 

~ S iii ss 

10 SSSSSS SSSSSS ££™ a™^ ^to™** 1250 

ca?5Sagga agtcttggaa ggtaagtacg ATAACTTGCC AGAAAAT 12 97 



20 



25 



50 
100 
150 
200 



15 2) INFORMATION FOR SEQ ID NO: 473 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1285 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473 

(TAGTTTGGC AACGACCTCC CTGCCATTTT GAACGCCCTC ACTCTTCAGC 
SSmSSSg TAACAAGCTT GTTCTTGAGG TTGCCCAGCA TCTCGGTGAG 

SSacottcc gtaccattgc tatggacggt accgagggtt tggtgcgtgg 
« ™^§§tgtc cacgacaccg gacaccccat catgactccc gtcggtgacg 

SSSctgSI ACGTATTCTT AACGTCACCG GTGACCCTGT AGACGAGCGT 250 

StcccotS aSctgacaa gttccgcccc atccacgccg aggcccctgc 
SSSSSS c^Sgctacca gtgccgaggt tcttgagacc ggtatcaagg 

TTGTCGACTT GCTCGCTCCT TACGCCAAGG GTGGTAAGAT TGGTTTGTTC 

an Sgtggtgccg GTGTCGGCAA GACCGTCTTC ATCCAGGAGC tgattaacaa 

ScG??AAG GCCCACGGTG GTTACTCCGT GTTCACTGGT GTCGGTGAGC 500 

SaCTCGTGA GGGTAACGAT TTGTACAAGG AAATGATCGA GTCCGGTGTC 550 

AToScCTCG ATGGTGAGTC CAAGGTCGCC TTGGTGTT CG GTCAGATGAA 600 
cSScCCCCT GGAGCCCGTG CCCGTGTCGC CCTTACCGGT CTTACCATCG 

45 SSagtattt CCGTGATGAG GAGGGTAAGG ATGTCTTGTT GTTCGTTGAC 
AACATTTTCC GCTTCACTCA GGCCGGTTCT GAGGTGTCCG CCTTGCTTGG 
TCGTATTCCT TCCGCTGTCG GTTACCAGCC TACCCTGGCC ACCGATATGG 

rrrlcCTTck GGAGCGTATC ACCACCACCA AGAAGGGTTC CGTTACATCC 850 

SSSSSS TCTACGTCCC TGCCGATGAT TTGACTGATC CCGCCCCTGC 900 

50 CACCACCTTC GCCCATTTGG ATGCCACCAC TGTCTTGTCT CGTGCCATCT 950 

Sgagttggg TATCTACCCC GCTGTCGACC CTCTCGACTC CAAGTCCCGT 1000 

S?C??GACG CCGCTGTCGT TGGTCAGGAG CACTACGATA CTGCCACCTC 1050 

CGTTCAGCAG ACTTTGCAGG CTTACAAGTC TTTGCAGGAT ATCATTGCCA 1100 

??CTTgStAT GGATGAGTTG TCCGAGTCTG ACAAGCTCAC CGTCGAGCGT 1150 

55 gScgtaaga TCCAGCGTTT CCTCTCCCAG CCTTTCGCTG TTGCTGAGGT 1200 

CTTCACTGGT ATTCAGGGCC GTCTTGTTCC TCTCAAGGAC ACTGTCCGCT 1250 

CCTTCAAGGA GATTCTCGAA GGTAAGTACG ATGCT 1285 



300 
350 
400 
450 



650 
700 
750 
800 
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2) INFORMATION FOR SEQ ID NO: 474 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 

TTGAACAAGG TCAATTGCCA GCTATTTTGA ACGCTTTGGA AATCGACACT 50 

CCAGAAGGAA AGTTGGTTTT GGAAGTCGCT CAACATTTGG GTGAAAACAC 100 

TGTCAGAACC ATTGCTATGG ATGGTACTGA AGGTTTAGTC CGTGGTGAAA 150 

2 0 ACGTTTCTGA CACTGGTGCT CCAATTTCCG TCC CAGTTGG TAGAGAAACC 2 00 

TTGGGTAGAA TTATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGTCC 250 

AATCAACTCC AAGATGAGAA AGCCAATTCA TGCTGATCCT CCATTATTCG 3 00 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACTGGTAT CAAGGTTGTC 350 

GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

2 5 TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAACATTG 45 0 

CCAAGGCTCA TGGTGGTTTC TCTGTCTTCA CTGGTGTCGG TGAAAGAACC 50 0 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACTG GTGTTATCAA 55 0 

CTTGGAAGGT GATTCTAAGG TCGCGTTGGT TTTCGGTCAA ATGAACGAAC 600 

CTCCTGGAGC TAGAGCTAGA GTCGCCTTGA CTGGTTTGAC CATCGCTGAA 65 0 

3 0 TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGTTGTTTA TCGACAACAT 70 0 

TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAT CAACCAACTT TGGCCACCGA TATGGGTTTG 80 0 

TTGCAAGAAA GAATTAC TAC CACCAAGAAG GGTTCCGTCA CTTCTGTCCA 85 0 

AGCCGTTTAC GTGCCAGCTG ATGATTTGAC TGATCCTGCT CCAGCTACCA 90 0 

35 CTTTCGCGCA TTTGGATGCC ACCACTGTGT TGTCCAGAGG TATCTCTGAA 95 0 

TTGGGTATCT AC CCAGCTGT CGATCCTTTG GATTCCAAAT CTAGATTGTT 100 0 

GGATGCTGCC GTCGTTGGTC AAGAACATTA CGATGTCGCT ACTCAAGTTC 105 0 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGC CATTTTG 110 0 

GGTATGGATG AATTGTCCGA ACAAGATAAG TTGACCGTCG AAAGAGCTAG 115 0 

4 0 AAAGATTCAA AGATTCTTGT CTCAACCTTT CGCTGTCGCT GAAGTCTTCA 12 0 0 

CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CTCTTCTTTC 125 0 

AAGGCTGTCT TGGACGGTAA GTACGATCAC TTG 128 3 



45 



55 



2) INFORMATION FOR SEQ ID NO: 475 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1290 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475 
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CGACGAAGGT 
GTGAC CAAGA 
GTCAGGGCTA 
5 TGTCACTGAT 
TGGGTAGAAT 
ATTGAATGTA 
TGAACAATCT 
ATTTGTTGGC 

10 GCTGGTGTCG 
CAAAGCTCAT 
GAGAAGGTAA 
TTGGAAGGTG 
ACCTGGTGCT 

15 ACTTCAGAGA 
TTCAGATTCA 
TCCATCTGCT 
TGCAAGAACG 
GCTGTTTATG 

2 0 CTTCGCTCAC 
TGGGTATTTA 
GATGCTACTG 
ACAAACTTTA 
GTATGGATGA 

2 5 AAGATTCAAA 
TGGTATCCCA 
AAGATGTCTT 



AACTTGCCAG 

CTTGGTTTTG 

TTGCTATGGA 

ACCGGTGCTC 

TATCAACGTT 

AGGAAAGAAA 

ACTGCTGCCG 

TCCTTACGCT 

GTAAAACCGT 

GGTGGTTTCT 

CGATTTGTAC 

ACT C CAAGGT 

AGAGCTAGAG 

TGAAGAAGGT 

CCCAAGCTGG 

GTCGGTTATC 

TATTACCACC 

TCCCAGCCGA 

TTGGATGCCA 

CCCAGCTGTC 

TTGTTGGTCA 

CAAGCTTACA 

ATTGTCCGAA 

GATTCTTGTC 

GGTAGATTGG 

GGAAGGTAAA 



CTATTTTGAA 

GAAGTTGCTC 

TGGTACCGAA 

CAATTTCTGT 

GTTGGTGAAC 

GCCAATTCAC 

AAATTTTGGA 

AGAGGTGGTA 

CTTCATCCAA 

CTGTCTTTAC 

CGTGAAATGA 

CGCCTTGGTC 

TTGCTTTGAC 

CAAGATGTCT 

TTCCGAAGTG 

AACCAACCTT 

AC C AAGAAAG 

TGATTTAACC 

CTACTGTCTT 

GATCCATTGG 

AGAACATTAT 

AGTCCTTGCA 

GCCGATAAAT 

CCAACCATTC 

TTAGATTACA 

TACGATCACT 



TGCTTTGACT 

AACATTTGGG 

GGTTTAGTCA 

CCCAGTTGGT 

CAATTGATGA 

GCTGAACCAC 

AACCGGTATT 

AGATTGGTTT 

GAATTGATTA 

CGGTGTCGGT 

AAGAAACTGG 

TTCGGTCAAA 

TGGTTTGACC 

TGTTGTTCAT 

TCTGCTTTGT 

GGCCACTGAT 

GTTCTGTTAC 

GATCCAGCTC 

GTCTAGAGGT 

ATTCTAAATC 

GATGTTGCCA 

AGATATCATT 

TGACTGTCGA 

GCTGTTGCCG 

AGATACTGTT 

TGCCAGAAAA 



TTGAAGAACG 5 0 

TGAAAACACC 100 

GAGGTACTGC 150 

AGAGGTACCT 2 0 0 

CAGAGGTCCA 2 50 

CTTCATTCGT 3 0 0 

AAGGTTGTCG 3 50 

GTTCGGTGGT 4 00 

ACAACATTGC 4 50 

GAAAGAACCA 5 00 

TGTCATCAAC 550 
TGAACGAACC 600 
ATTGCTGAAT 650 
TGACAACATT 700 
TGGGT CGTAT 750 
ATGGGTCTTT 8 00 

TTCTGTCCAA 850 
CAGCTACCAC 900 
ATTTCTGAAT 950 

CAGATTATTG 1000 

CTGGTGTTCA 1050 

GCTATTTTGG 1100 

AAGAGCTCGT 1150 

AAGTTTTCAC 1200 

AAATCTTTCA 1250 
1290 



30 

2) INFORMATION FOR SEQ ID .NO: 476 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 12 67 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utllis 

(B) STRAIN: Csp 3 88 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476 

TTTGAACGCC CTTGAGATCA AGAACCACGA GTCTGGAAGA TTGGTCCTCG 50 

AGGTTGCCCA GCACTTGGGT GAGAACACTG TCAGAACTAT TGCCATGGAT 100 

GGTACCGAAG GTCTCGTCCG TGGTGAGTCT GTTGTTGACA CCGGTTCTCC 150 

50 AATTACTGTC CCAGTTGGTC GTGAGACCTT GGGTCGTATC ATCAACGTTA 200 

TTGGTGAGCC AATTGATGAG CGTGGTCCAA TCAACACCAA GCACAGAAAC 2 50 

CCAATTCACG CTGAGCCACC ATCCTTCGCT GAGCAGTCTA CTGCTGCTGA 3 00 

GGTTTTGGAG ACTGGTATCA AGGTTGTCGA CCTTCTCGCC CCATACGCCA 3 50 

GAGGTGGTAA GATTGGTCTC TTTGGTGGTG CCGGTGTCGG TAAGAC CGTT 40 0 

55 TTCATCCAGG AGTTGATTAA CAACATTGCC AAGGCCCACG GTGGTTTCTC 450 

TGTTTTCACC GGTGTCGGTG AGAGAACCAG AGAGGGTAAC GATTTGTACC 50 0 

GTGAGATGAA GGAGACTGGT GTCATCAATC TCGAGGGTGA GTCCAAGGTT 55 0 

GCTCTCGTCT TCGGTCAGAT GAACGAGC CA CCAGGTGCCC GTGCCCGTGT 600 

CGCTTTGACC GGTTTGAC CA TTGCTGAGTA CTTCAGAGAT GAGGAGGGTC 650 

6 0 AGGATGTCTT GTTGTTTATT GACAACATTT TCAGATTCAC CCAGGCCGGT 700 
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TCCGAGGTGT CTGCCTTGCT TGGTCGTATT 
GCCAACTTTG GCCACCGATA TGGGTTTGTT 
CCCAGAAGGG TTCCGTCACT TCTGTCCAGG 
GATTTGACTG ATCCTGCTCC AGCCACCACT 
5 CACTGTGTTG TCCCGTGGTA TCTCTGAGTT 
ACCCATTGGA CTCCAAGTCC AGATTGTTGG 
GAGCACTACA ACACCGCCAC CGATGT C C AG 
GTCTCTCCAG GATATCATTG CTATTTTGGG 
CTGACAAGTT GACTGTCGAG AGAGCCAGAA 
10 CAGCCATTCG CTGTCGCTGA GGTTTTCACC 
TAGACTTCAG GACACCATCA AGTCCTTCAG 
ACGACCACTT GCCAGAG 



CCATCCGCTG TCGGTTACCA 750 

GCAGGAGAGA ATTACCACCA 8 00 

CCGTTTACGT CCCAGCTGAT 8 50 

TTCGCCCACT TGGACGC CAC 9 00 

GGGTATTTAC CCAGCTGTCG 950 

ACGCTGCCGT TGTTGGTGAC 10 00 

CAGACCCTTC AGGCTTACAA 10 50 

TATGGATGAG TTGTCTGAGG 1100 

AGATTCAGCG TTTCCTTTCC 115 0 

GGTATCCCAG GTAGATTGGT 1200 

AGAGGTTTTG GACGGTAAGT 1250 

1267 



15 



25 



2) INFORMATION FOR SEQ ID NO: 4 77 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1296 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 



30- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477 

CCAATTCGAC GAAGGTAGCT TGCCAGCTAT CTTGAACGCC TTGACCTTGA 5 0 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCCCAACA CTTGGGTGAA 10 0 

AACACTGTCA GAGCTATTGC TATGGATGGT ACCGAAGGTT TGGTCAGAGG 150 

3 5 TACTGCTGTC ACTGACACCG GTGCTCCAAT CTCCGTCCCA GTCGGTAGAG 2 00 

GTACCTTGGG TAGAATCATC AACGTTGTTG GTGAACCAAT TGACGACAGA 250 

GGTCCAATTG AATGTAAGGA AAGAAAGC CA ATT C ACGCTG AACCACCTTC 300 

TTTCGTTGAA CAATCCACTG CTGCCGAAAT TTTGGAAACC GGTATCAAGG 3 50 

TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 400 

4 0 GGTGGTGCCG GTGT CGGTAA GACTGTCTTT ATCCAAGAGT TGATTAACAA 45 0 

CATTGC CAAG GCCCATGGTG GTTTCTCTGT CTTCACTGGT GTCGGTGAAA 50 0 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCC TTGGTTTTCG GTCAAATGAA 60 0 

CGAACCACCT GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

45 CCGAATACTT CAGAGACGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 700 

AACATTTTCA GATTCACCCA AGCCGGTTCC GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCTGCCGTCG GTTATCAACC AACCTTGGCC AC CGATATGG 80 0 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTCCC AGCCGATGAT TTGACCGATC CTGCTCCAGC 900 

50 CACCACCTTC GCTCACTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

CCGAATTGGG TATCTACCCA GCTGTCGATC CATTGGACTC CAAGTCCAGA 1000 

TTGTTGGATG CTGCTGTTGT TGGTCAAGAA CATTACGATG TTGCCACTGG 1050 

TGTCCAACAA ACCTTGCAAG CTTACAAGTC TTTGCAAGAT ATCATTGC C A 1100 

TTTTGGGTAT GGATGAATTG TCTGAATCCG ATAAGTTGAC TGTCGAAAGA 1150 

55 GCTCGTAAGA TCCAAAGATT CTTGTCGCAA CCATTTGCTG TTGCCGAAGT 12 00 

TTTCACTGGT ATCCCAGGTA GATTAGTTAG ATTGCTGGAA ACCGTCCAAT 1250 

CTTTCAAAGA CGTCTTGGCT GGTAAATACG ATCACTTGCC AGAAAA 12 96 



60 
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10 



2) INFORMATION FOR SEQ ID NO: 47 8 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478 



20 



25 



30 



35 



40 



CTCACCTTGA 


50 


CTTGGGTGAG 


100 


TGGTTAGAGG 


150 


GTTGGCCGCG 


200 


TGACGAGAGA 


250 


ACCCCCCCTC 


300 


GGTAT CAAGG 


350 


TGGTTTGTTC 


400 


TGATCAACAA 


450 


GTCGGTGAGA 


500 


GACCGGTGTC 


550 


GTCAGATGAA 


600 


TTGAC CATTG 


650 


GTTCGTCGAC 


700 


CCTTGTTGGG 


750 


ACTGATATGG 


800 


CGTCACCTCA 


850 


CTGCTCCCGC 


900 


AGAGGTATCT 


950 


CAAATCGAGA 


1000 


TTGCCTCGAA 


1050 


ATCATTGCCA 


1100 


CGTTGAGAGA 


1150 


TTGCCGAGGT 


1200 


AC CGTCAGAT 


1250 


CGAGA 


1295 



45 



55 



2) INFORMATION FOR SEQ ID NO : 4 7 9 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 534 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Coccidioides immitls 

(B) STRAIN: Silveira 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479 

425 
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TTCAGGAATT GATTGTAAGT TCTGTTATCA ACTAAAGCCG ACAGCGGTTG 50 

CTGATATGCT CTAGAACAAC ATTGCTAAGG CTCACGGTGG TTACTCCGTG 100 

TTCACTGGTG TCGGTGAGCG TACCCGTGAG GGTAACGATT TGTACCATGA 150 

5 AATGCAAGAG ACCCGTGTCA TTCAACTCGA CGGAGAGTCC AAGGTCGCTC 2 00 

TTGTCTTCGG TCAAATGAAC GAGCCCCCTG GTGCCCGTGC CCGTGTTGCC 2 50 

CTTAC CGGTT TGAC CATTGC TGAATACTTC CGTGACGAGG AAGGCCAAGA 3 00 

CGGTAGGCTT CATGCTTCTA TCGCTAGGGG C GTGTGATAC AGGAGGC TAA 3 50 

TCGCTTTTCT AGTGCTTCTC TTTATTGACA ACATTTTCCG TTTCACTCAA 4 00 

10 GCTGGTTCTG AAGTGTCTGC CTTGCTCGGT CGTATTCCTT CCGCTGTCGG 4 50 

TTACCAACCT ACTCTCGCCG TCGACATGGG TGTTATG C AG GAACGTATCA 500 

CCACCACCAC CAAGGGATCC ATTACTTCAG TGCA 534 



15 



25 



2) INFORMATION FOR SEQ ID NO: 480 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 94 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albldus 

(B) STRAIN: ATCC 6603 0 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480 

CGTCTTGATT CAAGAATTGA TCAACAACAT CGCCAAGGCC CACGGTGGTT 5 0 

ACTCGGTCTT CACCGGTGTC GGTGAGCGAA CTCGAGAGGG TAACGATCTG 10 0 

TAC CACGAAG TGAGTTGCGC CGTCCGAGTG TTTCCCGGGG AATCGCAAGA 150 

3 5 CTGATGTTGT CCCTTCTTCT CAGATGAGGG AAACCGGTGT CATCAACCTC 2 00 

GAGGGTGACT CCAAGGTCGC CTTGGTCTTC GGTCAGATGA ACGAGCCCCC 25 0 

TGGAGCCCGA GCCCGAGTCG CCTTGACCGG TTTGACCATT GCCGAATACT 30 0 

TCCGAGACGA GGAGGGTCAG GATGTCTTGT TGTTCATTGA CAACATTTTC 35 0 

CGATTCACCC AAGCCGGTTC CGAAGTGTCC GCCTTGTTGG GTCGTATCCC 40 0 

4 0 CTCCGCCGTC GGTTACCAGC CCACTCTGTC CAC CGACATG GGTACCATGC 45 0 

AGGAGCGAAT TACCACCACC AAGAAGGGTT CCATCACTTC CGTC 4 94 



4 5 2) INFORMATION FOR SEQ ID NO: 4 81 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 bases 

(B) TYPE: Nucleic acid 

5 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fu sari urn oxysporum 

(B) STRAIN: WSA-212 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481 



60 
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TTCAGGAGCT TATCAACAAC ATCGCCAAGG 
TTCACTGGTG TCGGTGAGCG AACTCGTGAG 
AATGCAGGAG ACTTCCGTCA TTCAGCTTGA 
TGGTCTTCGG TCAGATGAAC GAGCCCCCTG 
5 CTTACCGGTC TTACTGTAGC TGAATACTTC 
CGTGCTGCTC TTCATTGACA ACATTTTCCG 
AGGTGTCTGC CCTTCTCGGT CGTATCCCCT 
ACCCTCGCCG TCGACATGGG TGGTATGCAA 
CAAGGGTTCC ATTAC 



CCCACGGTGG TTACTCCGTC 5 0 

GGTAACGATC TGTACCACGA 100 

TGGCGAGTCC AAGGTCGCCC 150 

GAGCTCGTGC CCGTGTCGCC 2 00 

AGAGATGAGG AGGGTCAGGA 2 50 

ATTCACTCAG GCCGGTTCCG 300 

CTGCCGTCGG TTACCAGCCC 35 0 

GAGCGTATTA CCACCACCAC 400 

415 



2) INFORMATION FOR SEQ ID NO: 482 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1281 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Geotrichum spp. 
25 (B) STRAIN: Lev- 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482 

AGGACAAC CT CCCCGCTATT CTTAACGCTC TTGAGCTTAA GAGAGATAAC 50 

3 0 GGTGAGAAGC TCGTTCTCGA GGTTGCCCAG CATCTGGGTG AGAACACTGT 100 

CCGTACTATT GCTATGGACG GTACTGAGGG TCTCGTCCGT GGTCAGCCCG 150 

TTGTTGACAC CGGTGCCCCC ATTAC CATTC CCGTTGGTCG TGGTACTCTT 200 

GGTAGAATTA TCAACGTCAT TGGTGAGCCC ATCGATGAGC GTGGACCCAT 250 

TGAGGCTACC AAGTACCTCC CCATCCACAC CGAGCCCCCC ACCTTCGCTG 300 

35 AGCAGTCTAC CTCCGCTGAG GTTCTTGAGA CTGGTATCAA GGTTGTCGAT 3 50 

CTCCTTGCCC CCTACGCCCG TGGTGGTAAG ATTGGTCTCT TCGGTGGTGC 400 

CGGTGTCGGT AAGACCGTTT TCATTCAGGA GCTGATTAAC AACATTGC C A 450 
AGGCCCATGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCCGT 500 
GAGGGTAACG ATTTGTACCG TGAGATGAAG GAGAC CGGTG TCATCAACCT 55 0 

4 0 CGAGGGTGAG TCTAAGGTCG CTCTCGTTTT CGGTCAGATG AACGAGCCCC 60 0 

CTGGAGCCCG TGCCCGTGTT GCTCTTACTG GTCTTACCAT TGCTGAGTAC 65 0 

TTCCGTGATG AGGAGGGTCA GGATGTGTTG CTCTTCGTTG ACAACATTTT 70 0 

CCGTTTCACT CAGGCCGGTT CCGAGGTGTC TGCCCTTTTG GGTCGTATTC 75 0 

CCTCCGCTGT CGGTTACCAG CCCACCCTTG CCACTGATAT GGGTGCCCTG 80 0 

45 CAAGAGCGTA TTACCACCAC CCAGAAGGGT TCCGTCACTT CCGTCCAGGC 85 0 

CGTCTACGTC CCTGCCGATG ATTTGACCGA TCCTGCCCCT GCTACCACCT 90 0 

TCGCCCATTT GGATGCCACC ACCGTCTTGT CGCGTTCCAT TTCTGAGTTG 95 0 

GGTATCTACC CCGCTGTCGA TCCCCTTGAT TCCAAGTCTC GTCTTTTGGA 1000 

TATCACCGTC GTTGGCCAGG AGCACTACGA TGTTGCTACC CAGGTCCAGC 1050 

50 AGACCCTCCA GTCCTACAAG TCTCTTCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCTGAGGC TGATAAGCTT ACTGTCGAGC GTGCCCGTAA 1150 

GATCCAGAGA TTCCTTTCCC AGCCCTTCAC TGTCGCTGAG GTTTTCACTG 1200 

GTATCGAGGG CCGTCTCGTT CCTTTGAAGG ACACTGTTCG CTCTTTCAAG 12 50 

GAGATCCTTG AGGGCAAGTA CGACCACCTC C 12 81 

55 



60 



2) INFORMATION FOR SEQ ID NO: 483 
( i ) SEQUENCE CHARACTERISTICS : 

427 
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(A) LENGTH: 586 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capaulatum 
10 (B) STRAIN: G185A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483 

ATT CAAGAAT TGATCGTACG TTCCTCCGCC CCACACACGA TCAATGGAGA 50 

15 AAGAAACAAA TTTTTTGGGA GTGGTCATTT TTTC TAATAA TTCGAATAGA 100 

ACAACATTGC CAAAGCCCAC GGTGGTTACT CCGTGTTCAC TGGTGTCGGC 150 

GAGCGGACCC GTGAAGGAAA TGACTTGTAC CACGAAATGC AGGAAACCCG 2 00 

TGTTATCCAG CTCGATGGAG AGTCCAAGGT CGCCCTCGTT TTCGGTCAGA 2 50 

TGAACGAGCC TCCCGGAGCC CGTGCCCGTG TTGCCCTCAC TGGTCTGACC 3 00 

20 GTTGCCGAGT ACTTCCGTGA C GAGGAAGGC CAAGACGGTA TGTATAAGTA 3 50 

TACACCGTAG CAAATCAACA CAGAGCTTCA CTCACGCTCG GATTTAGTGC 4 00 

TTCTCTTCAT CGACAACATT TTCCGCTTCA CTCAGGCCGG TTCCGAAGTG 4 50 

TCTGCCCTGC TCGGCCGTAT TCCCTCCGCC GTCGGTTACC AACCCACCCT 500 

CGCCGTGGAC ATGGGTGGTA TGCAGGAACG TATCACCACT ACCACCAAGG 5 50 

25 GCTCCATCAC CTCTGTGCAR GCCGTCTACG TCCCCG 58 6 



2) INFORMATION FOR SEQ ID NO: 484 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484 

45 GCGTGGCCAG AAGGTCATTG ACACTGGTGC TCCCATCACC ATCCCCGTCG 50 

GTGGTGC CAC ACTGGGTCGT ATTCTGAACG TCACGGGTGA CCCTATTGAC 100 

GAGCGTGGCC CCGTTAAGAC TGACGTTTTC CGCCCCATTC ACCGTGACCC 150 

CCCTGCCTTT GTCGAGCAGT CGACTGATGC CGAGATTCTC GAGAC TGGTA 2 00 

TCAAAGTCGT TGAC CTGATT GCCCCTTACG CCCGTGGTGG TAAGATTGGT 250 

50 CTGTTCGGTG GTGCCGGTGT CGGTAAGACC GTGCTTATCC AGGAGCTCAT 3 00 

CAACAACATC GCCAAGGCCC ACGGTGGTTT CTCCGTGTTC ACTGGTGTCG 3 50 

GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT GATTGAAACC 4 00 

GGTGTCATTA ACCTCGAGGG TGACTCGAAG GTGGCTCTGG TGTTCGGTCA 4 50 

GATGAACGAG CCCCCGGGTG CCCGTGCGCG TGTCGCTCTT ACTGGTCTGA 500 

55 CTGTGGCCGA GTACTTCCGT GACGACGAGG GCCAGGATGT GCTGCTGTTC 550 

ATTGACAACA TTTTCCGTTT CACCCAGGCC GGTTCGGAGA CTTCGGCTCT 6 00 

GCTGGGTCGT ATCCCTTCGT CGGTCGGTTA CCAGCCCACT TTGTCGACCG 650 

ATATGGGTGC CATGCAGGAG CGTATCAC CA CCACCAAGAA GGGTTCGATT 7 00 

ACGTCGGTGC AGGCCGTCTA CGTGCCCGCC GATGATGTCA CTGACCCTGC 750 

60 CCCTGCCACT ACCTTCGCCC ACCTTGACGC TACCACTGTG CTTGACCGTT 80 0 

428 
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CGATCGCTGA GCTGGGTATC TACCCCGCTG 

TCGCGTATGC TTGACCCCGC TATTGTGGGT 

CTCTGGCGTG CAGAAGC TGC TCCAGGACTA 

TTGCCATTCT GGGTATGGAT GAGCTTTCTG 

5 GAGCGTGCCC GTAAGATGCA GCGTTTCATG 

CCAGGTCTTT ACTGGTATCG AAGGTCGTCT 

TCAAGGCCTG CAAGGAGATC CTGTCGGGCA 



TTGACCCGTT GAACTCGAAC 850 

CAGGAGCACT ACGACGTGGC 90 0 

CAAGTCGCTT CAAGATATCA 95 0 

AGGAGGACAA GCTCACTGTC 100 0 

TCGCAGCCTT TCGCTGTCGC 1050 

TGTTGCCCTG AAGGACACGA 1100 

AG CACGACAA CCTCC 1145 



10 



2) INFORMATION FOR SEQ ID NO: 485 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 61 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 

TCCCGCCATC TTCAACGCCC TGGAGGTCCA GGACATGAAG AACGGTGGCC 50 

GCCTTGTTCT GGAGGTTGCC CAGCACCTTG GTGAGAACAC TGTTCGTTGC 10 0 

ATTGCTATGG ACGGTACCGA GGGTCTTGTC CGTGGTCAGA AGGTCCTTGA 150 

30 CACTGGTGCC CCGATCACTA TCCCTGTCGG TAACGGTACC TTGGGCCGYA 200 

TCCTGAACGT CACTGGTGAG CCTGTGGATG AGCGTGGTCC GGTTAAGACT 250 

GACGTCTACC GTCCAATCCA CCGTGAGCCC CCGGCGTTCG TTGAGCAGTC 3 00 

GACTGATGCT GAGATTCTTG AGACTGGTAT CAAGGTGGTC GACCTGCTCG 350 

CCCCATACGC TCGTGGTGGT AAGATTGGTC TWTTCGGTGG TGCYGGTGTY 400 

35 GGTAAGACCG TGCTGATTCA GGAGCTTATC AACAACATTG CCAAGGCCCA 45 0 

CGGTGGTTTC TCGGTGTTCA CTGGTGTCGG TGAGCGTACT CGTGAGGGTA 50 0 

ACGATCTGTA CCATGAAATG ATTGAGACTG GTGTCATCAA CGTTGACGGT 550 

GACTCGAAGG TCGCTCTCGT GTTCGGTCAG ATGAACGAGC CCCCGGGTGC 60 0 

CCGTGCCCGT GTCGCCCTGA CYGGTCTGAC CATCGCCGAG TACTTCCGTG 65 0 

4 0 ACGACGAGGG TCAGGATGTG CTGCTCTTCA TTGACAACAT TTTCCGTTTC 70 0 

ACTCAGGCTG GTTCGGAGAC TTCGGCTCTG CTGGGTCGTA TCCCGTCGGC 750 

TGTCGGTTAC CAGCCKACCC TTGCCACGGA TATGGGTGCC ATGCAGGARC 80 0 

GTATYACCAC CACCAAGAAG GGTTCGATTA CCTCGGTGCA GGCYGTTTAC 850 

GTTCCGGCCG ACGATGTGAC TGACCCTGCC CCGGCCACGA CCTTCGCCCA 900 

45 CTTGGACGCC ACCACGGTGT TGGAC CGTTC GATTGCYGAG CTGGGTATCT 950 

ACCCGGCCGT CGACCCGCTG AACTCGAAGT CGCGTATGCT TGACCCGTCA 100 0 

ATTGTCGGTG TGGAGCACTA CAACGTTGCT TCGGGTGTCC AGAAGCTTCT 1050 

YCAGGACTAC AAGTCGCTCC AAGATATCAT TGCCATTCTG GGTATGGATG 110 0 

AGTTGTCGGA GGAGGACAAG CTCACTGTCG AGCGTGCCCG TAAGATGCAG 1150 

50 CGTTTCCTGT CGCAGCCTTT CGCTGTGGCC CAGGTCTTCA CTGGTATCGA 12 00 

GGGTCGTCTT GTGTCGCTCA AGGACACCAT CAACGCCTGC AAGGAGATTC 1250 

TGTCCGGTAA G 12 61 



55 

2 ) INFORMATION FOR SEQ ID NO: 486 

( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 12 82 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrlma 

(B) STRAIN: DSM 70336 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486 

AGGAGGGCAA CTTGCCAGCT ATCTTCAACG CTTTGACGTT GAAGAACGGC 50 

GACCAGAAGT TGGTCTTGGA GGTGGCCCAG CACTTGGGTG AGAACACCGT 100 

CAGAACCATT GCCATGGACG GTACCGAGGG TTTGGTCAGA GGCGCCTCTG 150 

15 TCACCGACAC YGGTGCCCCT ATCTCCGTGC CTGTCGGCCG TGAGACCTTG 2 00 

GGTCGTATTA TCAATGTTGT TGGTGAGCCA ATCGACGAGA GAGGCCCAAT 250 

CAACAC CAAG AAGAGAAACC CTATTCACAC CGACCCACCT TCGTTTGTCC 3 00 

AG CAATC CAC TTCCGCCGAG GTCTTGGAGA CTGGTATCAA GGTTGTCGAC 3 50 

TTGTTGGCCC CTTACGCCAG AGGTGGTAAG ATTGGTTTGT TCGGTGGTGC 4 00 

2 0 CGGTGTCGGT AAGACCGTGT TCATC CAGGA GTTGAT TAAC AACATTGCCA 4 50 

AGGCCCACGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCAGA 500 

GAGGGTAACG ATTTGTACCG TGAAATGCAG GAGACTGGTG TCATCAACTT 550 

CGAGGGT GAC TCCAAGGTCG CCTTGGTTTT CGGTCAGATG AACGAGCCTC 600 

CAGGAGCTAG AG C Y AGAGTT GCCTTGACCG GTTTGACCAT CGCCGAGTAC 650 

2 5 TTCAGAGACG AGGAGGGTCA GGATGTGTTG TTGTTCGTCG ACAACATTTT 7 00 

CAGATTC AC C CAAGC CGGTT CTGAGGTGTC TGCCTTGTTG GGTCGTATTC 750 

CATCCGCTGT CGGATACCAG CCAACCTTGG CCACCGATAT GGGTGCCTTG 8 00 

CAGGAGAGAA TTACCACCAC CAAGAAGGGT TCCGTCACCT CCGTCCAGGC 850 

CGTCTACGTG CCAGCCGATG ACTTGACTGA CCCTGCGCCA GCCACCACTT 900 

3 0 TCGCCCACTT GGACGCCACC ACTGTGTTGT CCAGAGGTAT CTCTGAGTTG 95 0 

GGTATCTACC CCGCTGTCGA CCCCTTGGAC TCCAACTCCA GATTGTTGGA ' 100 0 

CGCCACCGTT GTTGGCCAGG AGCACTACGA CGTCGCCACC AACGTCCAGC 1050 

AAACTTTACA AG CTTACAAG TCCTTGCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCCGAGAC CCGACAAGTT GACCCGTCGA GAGAGCCAGA 1150 

3 5 AAGATC CAGA AGTTCTTGTC CCAGCCATTT GCCGTCGCCG AGGTTTTCAC 12 00 

CGGTATTGAG GGTAGATTGG TTAGATTGGA GGACAC CGTT AGATCCTTTA 1250 

AGGAGGTTTT GGAAGGTAAG TACGACCACT TG 12 82 



40 

2) INFORMATION FOR SEQ ID NO: 487 

( i > SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 482 bases 
45 (B) TYPE: Nucleic acid 

,. (C) . STRANDEDNESS :. .Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penlcillium marneffel 

(B) STRAIN: WSA-214 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487 

TGTCTTTATC CAGGAGTTGA TTGTACGTCT TTACCTTTCT GCCTGACTGT 
TTACGACAAC TAACGAAAGC GTAGAACAAC ATTGC CAAGG CTCACGGTGG 
TTACTCTGTC TTCACTGGTG TCGGTGAACG TACTCGTGAG GGTAACGATT 
6 0 TGTACCACGA AATGCAGGAA ACTGGTGTCA TTCAGCTCGA GGGTGAATCC 



430 
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AAGGTCGCCC TCGTGTTCGG 
CCGTGTCGCT CTTACTGGTT 
AAGGTCAGGA CGTGCTTCTC 
GCCGGTTCTG AGGTGTCTGC 
5 TTACCAGCCC ACCCTTGCCG 
CCACCACCAC CAAGGGTTCC 



TCAGATGAAC GAGCCCCCCG 
TGAC CATTGC CGAGTACTTC 
TTCATTGACA ACATTTTCCG 
CCTTCTGGGT CGTATCCCCT 
TCGACATGGG TATCATGCAG 
ATCACCTCCG TC 



GTGCCCGTGC 2 50 

CGTGACGAGG 3 00 

TTTCACTCAG 3 50 

CTGCCGTCGG 4 00 

GAGCGTATTA 4 50 

482 



10 2) INFORMATION FOR SEQ ID NO: 488 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488 



TCGAACAAGG TAACTTACCA GCTATCTTGA ATGC TTTAGA AATCAAAACC 50 
C CAAGTGGTA CTCCATTAGT TTTAGAAGTT GCTCAACATT TAGGTGAAAA 100 
CACTGTCAGA ACTATTGCTA TGGATGGTAC TGAAGGTTTA GTTCGTGGTG 150 

AACAAGTTAC TGATACTGGT TCTCCAATCA CTGTCCCAGT TGGTCGTGAA 2 00 

3 0 ACTTTAGGTC GTATTATCAA CGTTGTTGGT GAACCAATTG ATGAAC GTGG 2 50 

TCCAATTAAC ACCAAACAAA GAAACCCAAT TCACGCTGAA CCACCTTCAT 300 

TCAGTGAACA AT CAACTGCT GCTGAAGTTT TAGAAACTGG TATCAAAGTT 350 

GTYGATTTAT TAGCTCCATA CGCTAGAGGT GGTAAAATTG GTTTATTCGG 400 

TGGTGCCGGT GTCGGTAAAA CTGTCTTTAT CCAAGAATTG ATTAACAACA 450 

3 5 TTGCTAAAGC TCATGGTGGT TTCTCAGTTT TCACCGGTGT TGGTGAAAGA 500 

ACCAGAGAAG GTAACGATTT ATACCGTGAA ATGAAAGAAA CTGGTGTTAT 550 

TAACTTGGAA GGTGATTCTA AGGTCGCTTT AGTTTTCGGT CAAATGAATG 60 0 

AACCACCAGG TGCTAGAGCT CGTGTTGCTT TAACTGGTTT GACCATTGCT 650 

GAATAC TTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCGTTGATAA 70 0 

4 0 CATTTTCAGA TTCACCCAAG CCGGTTCAGA AGTTTCTGCC TTATTAGGTC 75 0 

GTATTCCATC TGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 800 

TTGTTACAAG AACGTATTAC CACCACACAA AAAGGTTCAG TTACTTCTGT 850 

CCAAGCTGTT TATGTCCCAG CTGATGATTT AACAGATCCT GCTCCAGCTA 900 

CCACTTTCGC CCATTTGGAT GCTACTACTG TCTTGTCTCG TGGTATTTCA 950 

4 5 GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCAAGATT 10 0 0 

ATTAGATGCT TCAGTTGTTG GTCAAGAACA TTATGATGTT GCTACCAACG 1050 

TTCAACAAAC TTTACAAGCT TACAAATCTT TACAAGATAT TATTGCTATT 1100 

TTAGGTATGG ATGAATTGTC TGAACAAGAT AAATTGACTG TCGAAAGAGC 115 0 

AAGAAAAATC CAAAGATTCT TATCTCAACC ATTTGCTGTT GCCGAAGTTT 12 0 0 

50 TCACTGGTAT YCCAGGTAGA TTGGTTAGAT TAAAAGACAC TATCAAAT CA 12 50 

TTCAAAGATG TTTTGGAAGG TAAATATGAT CACTTAC CAG 1290 



55 2) INFORMATION FOR SEQ ID NO: 489 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 91 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489 

10 

CCAATTCGAA CAAGGTAACT TACCAGCTAT CTTGAATGCT TTAGAAATCA 50 

AAACCCCAAG TGGTACTCCA TTAGTTTTAG AAGTTGCTCA ACATTTAGGT 100 

GAAAACACTG TCAGAACTAT TGC TATGGAT GGTACTGAAG GTTTAGTCCG 150 

TGGTGAACAA GTTACTGATA CTGGTTCTCC AATCACTGTC CCAGTTGGTC 2 00 

15 GTGAAACTTT AGGTCGTATT ATCAACGTTG TTGGTGAACC AATTGATGAA 250 

CGTGGTC CAA TTAACACCAA ACAAAGAAAC CCAATTCACG CTGAACCACC 3 00 

TTCATTCAGT GAACAATCAA CTGCTGCTGA AGTTTTAGAA ACTGGAATCA 3 50 

AAGTTGTTGA TTTATTAGCT CCATACGCTA GAGGTGGTAA AATTGGTTTA 4 00 

TTCGGTGGTG CCGGTGTCGG TAAAACTGTC TTTATCCAAG AATTGATTAA 450 

20 CAACATTGCT AAAGCTCATG GTGGTTTCTC AGTTTTCACC GGTGTTGGTG 500 

AAAG AAC GAG AGAAGGTAAC GAT TTATAC C GTGAAATGAA AGAAACTGGT 55 0 

GTTATTAACT TGGAAGGTGA TTCTAAGGTC GCTTTAGTTT TCGGTCAAAT 60 0 

GAATGAACCA CCAGGTGCTA GAGCTCGTGT TGCTTTAACT GGTTTGACCA 650 

TTGCTGAATA CTTCAGAGAT GAAGAAGGTC AAGATGTCTT GTTATTCGTT 700 

2 5 GATAACATTT TCAGATTCAC CCAAGCCGGT TCAGAAGTTT CTGCCTTATT 750 

AGGTCGTATT CCATCTGCTG TCGGTTATCA ACCAACTTTA GCAACTGATA 800 

TGGGTTTGTT ACAAGAACGT ATTACCACCA CACAAAAAGG TTCAGTTACT 850 

TCTGTCCAAG CTGTTTATGT CCCAGCTGAT GATTTAACAG ATCCTGCTCC 90 0 

AGCTACCACT TTCGCCCATT TGGATGCTAC TACTGTCTTG TCTCGTGGTA 950 

3 0 TTTCAGAATT AGGTATTTAC CCAGCTGTCG ATC CATTAGA TTCTAAATCA 1000 

AGATTATTAG ATGCTTCAGT TGTTGGTCAA GAACATTATG ATGTTGCTAC 1050 

CAACGTTCAA CAAACTTTAC AAGCTTACAA ATCTTTACAA GATATTATTG 1100 

CTATTTTAGG TATGGATGAA TTGTCTGAAC AAGATAAATT GACTGT CGAA 1150 

AGAGCAAGAA AAATCCAAAG ATTCTTATCT CAACCATTTG CTGTTGCCGA 12 0 0 

3 5 AGTTTTCACT GGTATCCCAG GTAGATTGGT TAGATTAAAA GACACTATCA 12 50 

AATCATTCAA AGATGTTTTG GAAGGTAAAT AT GAT CACTT A 1291 



40 2) INFORMATION FOR SEQ ID NO: 490 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 508 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 90 



CGTATTGATT CGTGAGTGGC CGTTCCCTTA CAGCAAGCTT ATAAAGGAGC 5 0 

GAAAAAGATC TGACATTCGG CTTATGTGCT ATACAGAGGA ACTCATCAAC 100 

AACGTCGCCA AGGCTCATGG TGGTTACTCT GTCTTCACCG GTGT CGGAGA 15 0 

GCGAACACGT GAAGGTAACG ATCTCTACCA CGAAATGATT GAAACCGGTG 200 

60 TCATTCAGCT CAAGAACGAC AAGTCCAAGG CCGCTCTGGT CTTCGGACAG 250 



432 
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ATGAACGAGC CCCCCGGAGC TCGTGCCCGT 
CATCGCCGAG TACTTCCGTG ACGTCGAAGG 
TCGACAACAT TTTCCGATTC ACCCAGGCCG 
CTCGGACGTA TCCCATCTGC TGTCGGATAC 
5 TATGGGTGGT ATGCAAGAGC GAATCACAAC 
CCTCCGTC 



GTCGCTCTGA CCGGTCTCAC 300 
ACAGGATGTG CTACTCTTCA 35 0 

GTTCAGAGGT ATCTGCCCTG 40 0 

CAGCCCACAC TCTCAACCGA 4 50 

CACCAAGAAG GGTTCGATTA 50 0 

508 



10 2) INFORMATION FOR SEQ ID NO: 491 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 686 bases 

(B) TYPE: Nucleic acid 
15 (C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula mucilaginosa 

(B) STRAIN: ATCC 66034 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491 



TGTC CTCATT CAGGAG CTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGCGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

TCTCCCCCTT CAAACTTTTG GCCGGCTAGT TGGCGCAGCG CAAACTGACG 150 

CGCGCGCCCT GTCCAGACTT GTACCACGAG ATGATCGAGA CTGGTGTCAT 200 

3 0 CCAGCTCGAG AACGACAACT CGAAGTGCGC TCTCGTGTTC GGC CAGATGA 250 

ACGAGCCCCC TGGTGCCCGT GCCCGTGTCG CTCTCACTGG GTTCGTCCTT 3 00 

TCTCTCTCTC GAGCGTCCTG GCT TGATACG GAACGCTGAC ACGTCACGCA 350 

GTCTCACTAT TGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGACGTGCTC 4 00 

CTCTTCATCG ACAACATCTT CCGTTTCACC CAGGGTGAGC CGCCTCCGCG 450 

3 5 GGCATTCTCC CGTTTCTTTC GCGCTGACGT CTGTC CCGTA TAGCCGGTTC 500 

GGAGGTGTCT GCCCTTCTCG GACGTATCCC GTC CGCTGTC GGATACCAGC 550 

CGACTCTCTC GACCGACATG GGTCAGATGC AGGAGCGTAT CACGTAAGTT 60 0 

TGGCCGCAGC TCCGTCCGCG GCGCCCTTTG TGTCTGACCG TGTTCCACCG 650 

CTCAGCACCA CCAAGAAGGG CTCGATCACC TGTGTC 68 6 

40 



2) INFORMATION FOR SEQ ID NO: 492 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmonlcolor 
55 (B) STRAIN: ATCC 32311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492 

TGTCTTGATT CAGGAGCTCA TCAACAACGT CGCCAAGGCG CACGGTGGTT 50 
60 ACTCCGTTTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

433 
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CACACTTTCG CCGACCTGGC 
ACCTCGAACA GATCTCTACC 
TCGACAACGA CAAGTCGAAG 
CCCCCTGGCG CCCGTGCCCG 
5 GTACTTCCGT GACGAC GAGG 
TCTTCCGTTT CACCCAGGGT 
TCGTGGTGAC TGACAACCTG 
CCCTTCTCGG TCGTATCCCG 
ACCGACATGG GTGGCATGCA 
10 TCTCTCGTTT CGCTCTGCAT 
CAAGAAGGGT TCGATCACCT 



CATTCCGGAA CGTGCGGACT 
ACGAGATGAT CGAGACCGGT 
ACTGCTCTCG TCTTCGGCCA 
TGTCGCTCTT ACTGGTCTCA 
GCCAGGACGT GCTTCTCTTC 
ACGTTCGATC CCGCCCGTCC 
TTGCGCGTGC AGCCGGTTCG 
TCCGCTGTCG GATACCAGCC 
GGAGCGTATC ACGTACGCCC 
CGCTCACGCA TGTTCGCCCG 
GTGTC 



GACGAGGACC 150 

GTCATTCAGC 2 00 

GATGAACGAG 2 50 

CCATCGCGGA 3 00 

ATCGACAACA 3 50 

AAC AC GAATG 4 0 0 

GAGGTGTCTG 4 50 

CACTCTCTCG 5 00 

TCTTCTGCTT 5 50 

ACAGCACCAC 6 00 
625 



15 2) INFORMATION FOR SEQ ID NO: 493 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1211 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-148 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493 

TCTCGTCCGT GGTGCTAAGG CCACTGACAC TGGTTCCCCC ATTACCATCC 50 

CCGTCGGCCC CGGTACCCTC GGTCGCATCA TGAACGTCAC CGGTGACCCG 100 

ATCGACGAGC GCGGTCCCAT CAAGACCGAC AAGTTCCGTC CCATCCACGC 150 

3 5 TGAGGCTCCC GAGTTCGTTG AC CAGTCGAC CAC CGCTGAG GTTCTCGTGA 2 00 

CTGGTATCAA GGTCGTCGAT CTGCTTGCTC CCTACGCCCG TGGTGGTAAG 250 

ATTGGTCTGT TTGGCGGTGC CGGTGTTGGC AAGACCGTGT TCATC CAGGA 300 

GCTCATCAAC AACATCGC CA AGGCCCACGG TGGTTACTCC GTCTTCACCG 350 

GTGTCGGCGA GCGTACCCGT GAGGGTAACG ATCTGTACCA CGAAATGCAG 4 00 

4 0 GAGACCTCTG TCATTCAGCT TGACGGTGAC TCCAAGGTCG CCCTGGTGTT 450 

CGGTCAGATG AACGAGCCCC CTGGTGCTCG TGCCCGTGTC GCCTTGACCG 500 

GTTTGACTGT CGC TGAGTAC TTCCGTGACG AGGAGGGC CA GGATGGTATG 550 

TTTTGAATTA TTTCCTTGTC GTACAGTTCC AAATCGAAGA ATTACTAACT 600 

TGTCAGTGCT TCTCTTCATC GACAACATTT TCCGCTTCAC CCAGGCCGGT 65 0 

4 5 TCTGAGGTGT CTGCCCTTCT GGGTCGTATT CCCTCCGCTG TCGGTTACCA 700 

GCCCACGCTC GCCGTGGACA TGGGTCTGAT GCAGGAG CGT ATTACCACCA 750 

CCCGCAAGGG CTCAATTACC TCCGTCCAGG CCGTCTACGT GCCCGCTGAC 8 00 

GATCTGACGG ATCCCGCCCC CGCCACCACC TTCGCCCATC TGGACGCCAC 8 50 

CACTGTGCTG TCCCGAGGTA TCTCTGAGCT GGGTATCTAC CCCGCTGTCG 900 

50 ACCCCCTCGA CTC CAAGTCG CGTATGCTGG ACCCCCGTAT TGTCGGTGAC 950 

GAC CACTACG AGACCGCCAC TCGCGTCCAG CAGATCCTCC AAGAGTACAA 1000 

GTCGCTGCAG GACATCATCG CCATTCTGGG TATGGACGAG CTGTCTGAGG 10 50 

CCGACAAGCT TACAGTCGAG CGTGCTCGTA AGATCCAGCG TTTC CTGAGC 110 0 

CAGCCGTTCA CGGTCGCGCA GGTCTTCACT GGTATCGAAG GCCAGCTGGT 115 0 

55 CGATCTGAAG GACAC TATCG CTTCGTTCAA GGCTATCCTG AG CGGTGAGG 12 0 0 

GTGACAGCCT T ^ 12 11 



60 2) INFORMATION FOR SEQ ID NO: 4 94 

434 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1133 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 



15 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494 



TCTTGTTAGA GGCACCCCAG TCAAGGACAC TGGTGCTCCA ATTACCATTC 50 

CAGTTGGTAA CGGCACTTTG GGCCGTATCG TCAACGTTCT CGGTGAGCCA 100 

ATTGATGAGC GTGGAC CAGT CAAGGCTGAC AAGTTCAGAC CTATTCACGC 150 

TGAGCCACCA ACCTTCGCTG ACCAGTCCAC CTCTGCCGAG GTTCTTGAGA 2 00 

2 0 CCGGTATTAA GGTTGT CGAC TTGCTTGCCC CTTATGC C AG AGGTGGTAAG 25 0 

ATTGGTCTTT TCGGTGGTGC CGGTGTCGGT AAGACTGTGT TCATTCAGGA 3 00 

GCTTATTAAC AACATTGCTA AGGCC CACGG TGGTTACTCT GTCTTCACTG 35 0 

GTGTCGGTGA GCGAACTCGT GAAGGTAACG ATTTGTACCA CGAAATGATG 4 00 

GAGACCGGTG TCATCAACCT TGAGGGTGAC TCCAAGGTGT CTCTTGTGTT 450 

25 CGGTCAGATG AACGAGCCTC CAGGAGCCCG TGCCCGTGTT GCCTTGACCG 5 00 

GTTTGACCAT TGCCGAGTAC TTCAGAGATG AGGAGGGC CA GGATGTCTTG 55 0 

TTGTTCATTG ACAACATTTT CCGATTCACC CAGGCCGGTT CTGAGGTCTC 600 

TGCCTTGTTG GGTCGTATCC CATCTGCCGT CGGTTACCAA CCAACCTTGG 65 0 

CTACTGATAT GGGTGGTCTT CAAGAACGTA TTACCACCAC TCAAAAGGGT 700 

30 TCCGTCACCT CTGTCCAGGC TGTCTACGTC CCAGCTGACG ATTTGACTGA 750 

TCCTGCCCCA GCTACCACCT TCGCCCATTT GGACGCCACC AC CGAATTGT 800 

CCCGATCTAT CTCTGAGTTG GGTATCTACC CAGCTGTCGA CCCTCTTGGT 850 

TCCAAGTCCC GTCTTTTGGA TGCCTCCGTC GTCGGCCAAG AGCACTACGA 90 0 

CGTTGCCGCC AACGTCCAAC AGACCTTGCA GGCCTACAAG TCTCTCCAGG 95 0 

3 5 ATATCATTGC C ATT TTGGGT ATGGACGAAT TGTCTGAGGC TGATAAGCTC 10 0 0 

ACTGTCGAGC GTGCTCGTAA GATGCAGAGA TTCCTTTCTC AGCCATTCAC 1050 

CGTCGCTGAG GTCTTCACTG GTCTCGAGGG TAGACTCGTT TCTTTGAAGG 110 0 

ACACCATCCG ATCCTTCAAG GAGATCCTTG ACG 113 3 



2) INFORMATION FOR SEQ ID NO: 495 



(i) SEQUENCE CHARACTERISTICS : 

4 5 (A) LENGTH: 60 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495 



GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 
GCGAAAATTG GACAATTGAG CAATTTAGCC ATTGGAGAAA AGAAATTTCG 100 
6 0 AGTATTAATT GTTTTTATAG AACAACATTG C CAAGGCTCA CGGTGGTTAC 150 



435 
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TCTGTCTTCA CTGGTGTCGG AGAGCGTACC CGTGAAGGAA ACGATCTCTA 200 

CCATGAAATG CAGGAGACCC GTGTCATTCA GCTTGATGGC GAGTCCAAGG 250 

TCGCCCTGGT CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 3 00 

GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 350 

5 TCAAGACGGT GAGTTTCTTA TGGATAAAAA AAAATTTTTT TTTTTTTTTT 4 00 

TTTTTTTCAA GAAATTCATG TTCTAACAAA GTGTATTCTA GTGCTTCTCT 4 50 

TCATCGACAA CATTTTCCGT TTCACTCAGG CTGGTTCCGA AGTGTCTGCC 500 

CTGCTTGGTC GTATCCCATC TGCCGTCGGT TACCAACCCA CTCTTGCCGT 550 

CGACATGGGT GGTATGCAGG AACGTATTAC CACCACCAAG AAGGGAT C CA 60 0 
10 TTACCTCC 



15 



50 



2) INFORMATION FOR SEQ ID NO: 496 



2) INFORMATION FOR SEQ ID NO: 4 97 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1148 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Yarrowi* lipolytics 



608 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Wangiella. dermatitidis 

(B) STRAIN: WSA-229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496 

3 0 GTTTATTCAA GAACTCATTG TGGGTGGCAT TCTCATAATG TTTCGGCCAC 50 

AATTACTGAT TGAAAATAGA ACAACATTGC AAAGGCTCAT GGTGGTTACT * 100 

CCGTGTTCAC TGGTGTCGGC GAGCGAACTC GTGAGGGTAA CGACTTGTAC 150 

CACGAAATGC AGGAGAC CTC TGTCATTCAG CTCGATGGCG AGTCCAAGGT 2 00 

CGCGCTGGTG TTTGGTCAAA TGAACGAACC TCCTGGTGCT CGTGCTCGTG 250 

35 TTGCTCTGAC TGGGTAAGTT GTTCCTTCGC TTCTTGCGCG TATCCACATC 3 00 

CCCATCTTGA GAATACGTCT GCCACCATGT CATGTGATGT TGGGCTGGTT 35 0 

CTGGTTTTTG GGAGGCCCTC AAGTTCAATT TTTGGATGAC AGCACCAGCT 4 00 

TTACAAGATT ATGCTAACTT AATGGAGTCT TACGGTGGCT GAGTTCTTCA 450 

GGGATGAGGA GGGACAGGAT GGTAAGTTTG ATAACAATCT CGTCGGTGTC 500 

4 0 AATATCGACG GCGTACTCTT CGCAT CAAAA AACCAAAGAG GTGGTTTGGT 550 

GTGAGAAGTG CGCCGGAAAT AATGGCAACC ACGTGACAAT GACCACGTGT 600 
GGGGCTCCCG TGCTAACACG TGACAGTCTT GCTCTTCATC GACAACATTT 650 
TCCGATTCAC TCAASCCGGT TCTGARGTGT CTGCCTTGCT TGGTCGTATT 700 
CCATCTGCCG TCGGTTACCA ACCCACACTC GCCGTCGACA TGGGTCTCAT 
4 5 GCAGGAACGT ATCACCACCA CCCGGAAGGG ATCCATCACA TCTG 



750 
794 
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(B) 



STRAIN: ATCC 382 95 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497 



5 TCTTGTCCGA 
CCGTCGGCCG 
ATTGACGAGC 
TGACCCCCCT 
CCGGTATTAA 

10 ATTGGTCT CT 
GCTGATTAAC 
GTGTCGGTGA 
GAGACTGGTG 
CGGTCAGATG 

15 GTCTGACCAT 
CTCTTCGTTG 
CGCTCTGCTT 
CCACCGATAT 
TCCGTCACTT 

2 0 TCCTGCTCCC 
CCCGAGGTAT 
TCCAAGTCTC 
TGTTGCTTCC 
ATATCATTGC 

2 5 ACCGTCGAGC 
CGTCGCCGAG 
ACACTGTCCG 



GGCACCGCCG 
AGGTACCCTT 
GAGGACCCAT 
ACCTTCGCTG 
GGTCGTCGAC 
TCGGTGGTGC 
AACATTG CCA 
GCGAACCCGA 
TCATCAACCT 
AACGAGCCTC 
TGCCGAGTAC 
ACAACATTTT 
GGTCGAATTC 
GGGTGCCCTC 
CCGTCCAGGC 
GCCACCACCT 
TTC CGAGCTG 
GACTTCTGGA 
AACGTCCAGC 
CATTCTTGGT 
GAGCTCGAAA 
GTTTTCACCG 
ATCCTTCAAG 



TCGCTGACAC 
GGTCGAATCA 
CGAGGCTTCC 
AGCAGTCTAC 
CTCCTCGCCC 
CGGTGTCGGT 
AGGCCCATGG 
GAGGGTAACG 
CGAGGGTGAG 
CCGGAGCCCG 
TTCCGAGACG 
CCGATTCACC 
CCTCCGCTGT 
CAGGAGCGAA 
CGTCTACGTG 
TCGCCCATCT 
GGTATCTACC 
TATCGATGTT 
AGACCCTCCA 
ATGGATGAGC 
GATCCAGCGA 
GTATTGAGGG 
GAGATCCTTG 



CGGTGCTCCC 
TCAACGTCTG 
AAGTACCTCC 
CTCCGCTGAG 
CTTACGCCCG 
AAGACTGTCT 
TGGTTTCTCC 
ATCTTTACCG 
TCTAAGGTCA 
TGCCCGAGTC 
AGGAGGGTCA 
CAGGCCGGTT 
CGGTTACCAG 
TTACCACCAC 
CCTGCCGATG 
TGACGCCACC 
CCGCTGTCGA 
GTCGGAAAGG 
GGCTTACAAG 
TGTCCGAGCA 
TTCCTGTCTC 
ACGACTTGTC 
ACGGTAAGCA 



ATCACTATCC 50 
TGGTGAGCCC 100 
CCATCCACGC 150 
GTTCTCGAGA 2 00 

AGGTGGTAAG 250 
T CAT C CAGGA 3 00 

GTTTTCTGCG 350 
AGAGATGAAG 4 00 

CCCTCGTCTT 450 
GCCCTTACTG 500 
GGATGTGTTG 55 0 

CCGAGGTGTC 60 0 

CCCACTCTGG 650 
C CAGAAGGGT 70 0 

ATTTGACCGA 750 
ACCGTCCTGT 800 
TCCCCTTGAT 850 
AGCACTACGA 900 
TCTCTCCAGG 950 

GGACAAGCTG 1000 

AGCCCTTCAC 1050 

TCTCTCAAGG 1100 

CGATGCTC 114 8 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 498 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498 



GCGCTATTGT 
CTTGGGAAAT 
CCAGACTCGT 

50 TCGTTGTCTT 
GAACTGGTCG 
GGGTGAAGAG 
GACGCCGTGA 
GTCGACACCT 

55 GATGTCTGTC 
CTGGTCGTGT 
GTTGGAGGCT 
CTTCAAGAAG 
TCCTGCGTGG 

60 GTTCCCGGCA 



CGTTGTTGCT 
ACGTCGTAAA 
GAGCATTTGC 
CGTCAACAAA 
AACTCGAGAT 
ACTCCGATCA 
CGACAT CGGT 
GGATCCCCAC 
GAGGAAGTGT 
CGAGCGTGGT 
CCTTCGAACC 
AGCTGTGATG 
TATCCGACGT 
GCACCAAGGC 



GCCTCCGACG 
ACACGTCGCT 
TGCTCGCCCG 
ATCGATGCTA 
GCGTGAGCTG 
TTTTCGGTTC 
AAAGACAGAA 
TCCTCAGCGT 
TCTCTATCGC 
ATCTTGAAGA 
CAAGAAGACC 
AATCGCGTGC 
GAAGACGTCA 
TCACGACAAG 



GTCAGATGTA 
TACGTTTTCG 
CCAGGTTGGT 
TTGATGATCC 
CTGAACAGCT 
CGCTCTCTGT 
TTGAGCAGCT 
GACCT CGACA 
CGGCCGTGGT 
AGGACTCTGA 
AAAGTCACCG 
TGGTGACAAC 
AGCGTGGTAT 
TTCCTCGTCT 



GGTGGAACAT 50 

CGAATAGGCC 100 

GTCCAGAAGA 150 

GGAGATGCTG 200 

ACGGTTTCGA 250 

GCTCTCGAAG 300 

TATGAACGCT 350 

AACCTTTCTT 4 00 

ACCGTGGCTT 4 50 

GGTTGAGATT 5 00 

ACATTGAAAC 550 

TCTGGTCTCC 6 00 

GGTCATTGCT 650 

CCATGTACGT 7 00 
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CCTGACCGAG GCGGAGGGTG 
GTCCC CAAGT CTTCATCCGT 
AGATCTTCCG AG AGATT AG C 
TGCTGACCTC AGCTTCCCTG 
5 GTGACAACGT CGAGATGATC 
GCTGGTCAAC GCTTCA 



GTCGTCGTAC TGGCTTCGGT 
ACTGCAGGTA AGTTCCCGCA 
GAT ATATGC T AATGATTCAT 
ACGGCGACCA ATCTCGCAGA 
CTGAAGACCC ACCACCCTGT 



GCCAACTACC 750 
CACCGTGTCC 8 0 0 

CAGACGAGGC 8 50 

GTTATGCCTG 9 00 

TGCTGCTGAG 950 
966 



10 2) INFORMATION FOR SEQ ID NO: 499 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499 



TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCCCAAACCC 5 0 

GTGAGCACTT GCTTCTTGCT CGTCAAGTTG GTGTTAAGCA CATTGTTGTT 100 

TTCGTTAACA AGATTGATAC TATTGATGAT CCTGAAATGT TGGAACTTGT 15 0 

CGAAATGGAA ATGAGAGAAC TTCTTTCTTC TTACGGTTTT GATGGCGATA 2 00 

3 0 ACACCCCTGT CATTATGGGT TCTGCTCTCT GTGCTCTTGA AGGTCGTGAA 250 

CCAGAAATTG GTGAACAAAG AATCAACCAA CTCCTTGATG CTATCGATGA 3 00 

ATACATTCCT ACCCCAGTTC GTGATATGGA CCAACCTTTC TTGATGCCAC 350 

TTGAAGGTGT TTTC TCTATT CCAGGTCGTG GTACTGTTGC CAC TGGACGT 400 

GTCTATCGTG GTACTTTGAA GAGAGGTGAA GAAGTTGAAG TTGTTGGCTA 450 

3 5 CAATGATGCT CCAATCAAGA CCACCGTTAC TGGTATTGAA ATGTTCAAGA 50 0 

AGGAAC TTGA TCAAGCTCAA GCTGGTGACA ACGCTGGTAT TCTTTTGAGA 55 0 

GGTGTTAAGC GTGAAGACCT TAAGCGTGGT ATGGTTGTTG CTAAAC CAGG 60 0 

TAC CGTTAAG CCACACACCA AGTTCCTTGC CTCCATCTAT GTTTTGACTA 65 0 

AGGAAGAAGG TGGCAGACAC TCTGGCTTTG GTCTTAACTA CAGACCTCAA 700 

4 0 CTTTTCCTTG GTTCTGCTGA TGTTACCACT GTCTTGACCT TCCCAGAGGG 750 

TGTTGACCAA AGCACTCAAG TCATGCCAGG TGACAACACT GAAATGGTTT 800 

GCGAACTTGT TCACCCAGTT GCTGTCGAAC AAGGC CAACG TTTCAA 84 6 



45 

2) INFORMATION FOR SEQ ID NO: 500 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 846 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500 
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GGTGCCATTA TTGTTGTTGC TGCCTCTGAT GGACAGATGC CCCAGACCCG 5 0 

TGAGCATCTT TTGCTTGCCC GCCAAGTCGG TATGCAAAAG GTCGTTGTGT 100 

TTGTTAACAA GATTGATACC ATTGATGACC CTGAAATGCT TGATCTTGTC 150 

5 GAGATGGAGA TGCGTGAACT GTTGAATGAA TATGACTTCG ATGGAGATAA 2 00 

CTCTCCTGTC ATTATGGGCT CTGCTCTTGC TGCTCTTGAG GACAAGAACC 2 50 

CCGAGATTGG TAAGGAC CGT ATCATGCAGC TCTTGGACGC TGTTGATGAA 3 00 

TGGATCCCTA CCCCCGAGCG TGACCTTGAC AAGCCTTTCA TGATGCCTAT 3 50 

TGAGGCCTCT TTC TCCATTT CTGGTCGTGG TACTGTTGCC ACTGGCCGTG 400 

10 TCGAGCGTGG TATTCTCAAG AAGGGTGAGG AAGTCGAGAT CGTTGGTTTC 450 

AACAAGCAGC CCCTGAAATC TGTTGTTACT GGTATTGAAA TGTTCAAGAA 50 0 

GGAACTTGAT CAGGCCCAGG GCGGTGATAA TGCTGGTATC TTGCTTCGTG 550 

GTATTCGTCG TGAGGACTTG CAGCGTGGTA TGGTTTTGGC CAAGCCTGGA 600 

ACTGTTAAGG CTCACACCAA GTTCCTTTCC TCCATCTACG TTCTCTCCAA 650 

15 GGAAGAGGGC GGCCGTCACT CTCCTTTCGG TATGAACTAT CGTCCCCAGA 700 

TGTTCGTTTC TGCAGCTGAT GTCACCGTTA CTCTTACTTT CCCTGAGGGT 750 

GTTGAACAGC ACACTCAGGT CTTCCCTGGT GAGAACACCG AGATGGTTGG 8 00 

CGAGCTCGTT CACCCTACTG CTATTGAGGT TGGTCAACGC TTCAAC 84 6 



20 



35 



2) INFORMATION FOR SEQ ID NO: 501 



( i ) SEQUENCE CHARACTERI STI CS : 
25 (A) LENGTH: 944 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501 



AGTTGTCGTC GTTGCTGCTT CAGACGGTCA AATGTATGCA AC CGAGAGCA 50 

CTCCCGGATC TTGGTTTAAA TGGCACTAAT ATAAGACAGG CCTCAAACTC 100 

4 0 GAGAGCATTT ACTTCTCGCC CGTCAGATCG GTATCCAAAA AATCGTCGTC 150 

TTCGTGAACA AGGTTGATGC CATCGAGGAC AAAGAGATGT TGGAGCTTGT 200 

TGAATTGGAG ATGCGTGAAC TCCTAACCAG CTACGGTTTC GAGGGTGAAG 250 

AAACTCCCAT CATTTTTGGC TCTGCTCTCT GTGCCCTCGA AGGAAGACAA 3 00 

CCCGAGATCG GTGTTACCAA GATTGATGAG CTCTTGCAGG CCGTCGACAC 3 50 

4 5 CTGGATTCCC ACTCCTCAGC GTGAGACTGA CAAGCCCTTC TTGATGTCCA 400 

TTGAGGAAGT GTTCTCTATT TCCGGACGAG GAACCGTTGT CTCCGGCCGT 4 50 

GTGGAGCGTG GTATCCTCAA GAAGGACTCC GAAGTTGAAA TTGTCGGCGG 500 

TTCGCCCGAG CCAATCAAAA CCAAGGTTAC CGATATCGAA ACCTTTAAGA 5 50 

AGTCTTGCGA CGAGTCTCGC GCTGGTGATA ACTCCGGCTT GCTCCTACGA 600 

50 GGCGTTAAGC GTGAAGATAT TAGCCGTGGC ATGGTCGTCG CTGTACCAGG 650 

AAGTGTCAAG GCCCATACTG AATTCTTAGT TTCGCTTTAC GTCCTCACCG 700 

AAGCTGAGGG TGGGCGCAAA TCTGGATTCA GCAGCAAGTA CCGCCCACAG 750 

ATGTTCATTC GCACTGCCGG TATGTAATAC TGTGATAATT TCGTTGACAT 800 

GGTACTGATT GAATTCTATA GACGAAGCGG CTCAGCTCAG CTGGCCCGGA 8 50 

55 GAAGATCAAG ACAAGATGGC TATGCCAGGA GACAATATCG AAATGATTTG 900 

CACCACCTTG CAC CCAGTTG CCGCCGAGGC TGGCCAGCGA TTCA 944 



60 2 ) INFORMATION FOR SEQ ID NO: 502 

439 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusari urn oxysporum 

(B) STRAIN: WSA-212 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502 



GCTATCATCG TCGTTGCTGC CTC CGATGGA CAGATGCCCC AGACCCGTGA 50 

ACACTTGCTC CTCGCTCGTC AGGTCGGTGT CCAGCGAATT GTCGTCTTCG 100 

TCAACAAGGT CGATGCCATT GATGAC CCCG AGATGCTTGA GCTCGTCGAG 150 

ATGGAGATGC GCGAGCTTCT TAACACCTAC GGCTTCGAAG GCGACGACAC 2 00 

20 TCCCGTCATC ATGGGCTCTG CTCTTATGTC TCTC CAGAAC CAGCGCCCCG 250 

AGATTGGCAC CGAGAAGATC GATGAGCTCC TTGCTGCCGT CGACGAGTGG 3 00 

ATCCCAACCC CCGAGCGTGA CCTTGACAAG CCCTTCCTTA TGTCCGTCGA 350 

GGATGTCTTC TCCATTGCCG GCCGTGGTAC CGTCGTGTCT GGCCGTGTGG 4 00 

AGCGTGGTAT TCTGAAGCGT GATCAGGAGA TCGAGCTTGT CGGAAAGGGG 450 

25 CAGGAGGTTA TCAAGACCAA GGTTAC CGAC ATTGAGACCT TCAAGAAGTC 500 

TTGTGAGCAG TCCCAGGCTG GTGACAACTC TGGTCTCCTC ATC CGAGGTG 550 

TTCGCCGTGA GGATGTCCGC CGTGGTATGG TTGTCTGCGC TCCTGGCACC 600 

GTCAAGTCTC ATACC CAGTT TCTCGCTTCC CTCTACGTCC TCACCAAGGA 650 

GGAGGGTGGC CGACACACCG GTTTC CAGGA GCACTACCGA CCCCAGCTCT 700 

3 0 ATCTCCGAAC TGCAGATGAG TCCATTGACC .TGACTTTCCC CGAGGGTACT 75 0 

GAGGATGCCT CCAGTAAGAT GGTCATGCCT GGTGACAACA CCGAGATGGT 800 

TGT CAC CAT G GGTCACCCCA ATGCCATCGA GGTTGGTCAG CGATTCAAC 849 



35 
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2) INFORMATION FOR SEQ ID NO: 503 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1064 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: G186A5 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 



TGGTGCTATT GTCGTTGTTG 
CGAGGGACTG CTGAGGGTTT 
GCATGATGAT AC TAATATTC 

55 GAGCATTTGC TCCTTGCCCG 
CGTGAACAAA GTCGACGCCC 
AGTTAGAAAT GAGAGAGCTC 
ACACCCATCA TCTTTGGTTC 
TGAGTTGGGA GAAAAGAAAA 

60 GGATCCCCAC ACCACAACGT 



CTGCTGCTGA CGGCCAAATG 
TATGCTTTTT AGGCCCCCTT 
GGAAACGTAT CTATTAGGCC 
ACAGGTCGGT GTCCAAAAGA 
TTGAGGACAA GGAGATGTTG 
TTAAACACCT ACGGCTTCGA 
TGCCCTTTGC GCCATGGAAG 
TTGATGAATT GCTGGAGGCT 
GATACCGAAA AACCTTTCTT 

440 



TAAGACGCCG 50 

GTTTCTGAGA 100 

TCAAACACGT 150 

TCGTCGTTTT 200 

GAGCTTGTCG 2 50 

GGGTGAAGAG 3 00 

GCCGTGAGCC 350 

GTTGATACTT 40 0 

GATGTCCGTT 450 
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GAGGAAGTAT TCTCTATCTC CGGCCGTGGA ACCGTTGCCT CCGGTCGTGT 50 0 

TGAGCGCGGT GTCCTCAAGA AGGATTCAGA AGTCGAGCTA ATTGGGGGCG 550 

GCTCCACCCC CATCAGGACG AAGGTAACTG ATATCGAAAC TTTCAAGAAA 600 

TCCTGTGACG AGT CTAGAGC TGGGGACAAC TCCGGTCTTT TATTGCGTGG 6 50 

5 TATCAAGCGT GAAGATATCC GCCGTGGTAT GGTAGTTGCC GTTCCTGGCA 700 

GCGTCAAGGC CCACGACAAG TTCTTGGTGT CGATGTATGT CCTGACCGAA 7 50 

GCTGAGGGTG GTCGCCGAAC CGGATTCGGC CAGAACTATC GTCCTCAAAT 800 

GTTCATCCGC ACAGCTGGTA TGTCAAAATG GGACCCCTTT TCATAATCCT 85 0 

TTCTTTTTTT CCTTTTCCTC TCTATCTCTC TTTCTGTTTC CTTTCAACTC 90 0 

10 GCCTGATTCA CGAAATTAAC TAACCCGTTT GATTATAGAC GAAGCCGCCC 950 

ATCTCAGCTT CCCTAGTGGA GCAGATGAAA GCAAACTCGT TATGCCTGGT 10 00 

GACAACGTCG AGATGATC C T CCAGACACAC CGCCCCGTGG CTGCTGAGGC 1050 

CGGCCAGCGA TTCA 1064 



15 
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55 



2) INFORMATION FOR SEQ ID NO: 504 



(i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 982 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
■(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 



TGGTGCTATC GTCGTTGTTG CTGCCTCTGA CGGCCAAATG TAGGGATTTT 50 

GCAAGACTGG TGAAAAAATC TAAAGAAAAT AGAAAAGATT GTGCTGATGT 100 

3 5 TTGGTATCAG GCCCCAAACA CGCGAGCATC TGCTTCTTGC CAGACAAGTC 15 0 

GGTGTTCAGA AAATCGTTGT TTTCGTCAAC AAGGTCGATG CTGTAGAGGA 2 00 

TAAGGAGATG TTGGAGCTTG TCGAATTGGA GATGAGAGAG CTCTTGACCA 2 50 

CCTATGGGTT CGAGGGTGAG AAGACAC CTA TCATCTTTGG TTCTGCGCTC 3 00 

TGTGCTATGG AGGGCCGTCA GCCCGAGTTG GGAGAGCAGA AAATTGATGA 3 50 

4 0 ATTACTCGAG GCTGTGGATA CTTGGATCCC TACGCCACAG CGTGATACTG 4 00 

ACAAGCCCTT CCTGATGTCC ATTGAGGAGG TGTTCTCTAT CTCTGGACGA 45 0 

GGAACCGTTG CCTCCGGCCG CGTTGAGCGT GGTATCCTCA AGAAGGACTC 50 0 

CGAAGTTGAA ATTATTGGCG GCGGTGTTCC CACAATCCTG AC CAAGGTGA 55 0 

CTGATATCGA AACCTTCAAG AAGTCTTGCG ACGAGTCCAG AGCCGGGGAC 6 00 

45 AACTCCGGCC TCTTGTTGCG CGGTGTCAAG CGTGAGGATA TCCGCCGTGG 650 

TATGGTCGTT GCAGTTCCCG GAAGCGTCAA AGCACATGAC AGATTCTTGG 700 

TGTCGATGTA CGTTCTGACC GAGGCTGAGG GTGGTCGCCG CACTGGCTTC 750 

GGTCAGAACT ATCGTCCTCA AATGTTCATC CGCACAGCTG GTACGTTCAT 8 00 

TCTTTCACTA TATTCCTATA TGCATAGCCC GATCCTCCCA TTAAC TAATT 850 

50 GACACAGACG AGGCTGCTGA ACTCAGCTGG CCTGATGGAG ACGACGAAGC 900 

CAAAATGGTC ATGCCCGGTG ACAATGTTGA AATGGTCCTG AAGTCACACC 950 

GCCCGGGGCC GCTGAGGCTG GACAGCGATT CA 982 



2 ) INFORMATION FOR SEQ ID NO: 505 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 931 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Panic i Ilium marneffei 
<B) STRAIN: ATCC 58950 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA AT AGGCC CCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

15 CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 2 00 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 3 00 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 3 50 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 400 

2 0 GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 50 0 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 60 0 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

25 CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 70 0 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 75 0 

CATCCGTACT GC CGGTATGT TCCCTTTCAA AGT CAATTAA TGAGCGATTT 80 0 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 8 50 

CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 90 0 

3 0 ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 



2) INFORMATION FOR SEQ ID NO: 506 



35 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 <D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 

50 TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCTCAAACCA 50 

GAGAACATTT RTTATTGGCT AGACAAGTTG GTGTTCAACA CATTGTTGTC 100 

TTTGTTAACA AAGTTGATAC TATTGATGAC CCAGAAATGT TGGAATTAGT 150 

TGAAATGGAA ATGAGAGAAT TGTTAAGTAC TTATGGTTTT GATGGTGATA 2 00 

ACGTCCCAGT TGTTATGGGT TCTGCTTTAT GTGCCTTGGA AGGTCGTGAA 250 

55 GAAGAAATTG GTGT CAAAGC TATTGATAAA TTATTAGCTG CTGTTGATGA 3 00 

ATATATCCCA ACCCCACAAA GAGATTTAGA AAAAC CATTC TTGATGGGTG 3 50 

TTGAAGATGT CTTYTCAATC TCAGGTAGAG GTAC CGTTGT TACTGGTCGT 4 00 

GTTGAACGTG GTAACTTGAA GAAAGGTGAT GAAGTTGAAA TTGTTGGTTT 4 50 

AAACAAAACT CCATTGAAAA CTACTGTYAC NGGTATTGAA ATGTT CAAAA 500 

6 0 AAGAATTGGA CCAAGC TATG GCTGGTGATA ACTGTGGTAT CTTATTACGT 550 

442 
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GGTATCAAAA GAGATGACAT YAAAAGAGGT 
TACCATCTCW GCTCACACTA AATTCTTAGC 
AAGAAGAAGG TGGTCGTCAC TCAGGTTTTG 
TTATTCATCA GAACTGGTGA TGTTACCGTT 
5 TGGTGATTCA TCTCAACAAA TCTTACCAGG 
GTGAATTGGT TCACCCAACT GCTTTAGAAG 



ATGGTTATTG CTAAAACCGG 600 

CTCAATGTAT ATTTTGACTA 6 50 

GTGAACATTA CAGACCTCAA 700 

GTTTTAACCT T Y C CAGAAGG 7 50 

TGACAATGTC GAAATGGTTT 800 

CTGGTCAAAG ATTCAA 84 6 



10 2) INFORMATION FOR SEQ ID NO: 507 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 964 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-22 5 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507 



GGAGCTGTTG TCGTTGTCGC AGCTTCTGAC GGTCAAATGT AATTGAATGC 50 

CCGCCCAGAC GGATGAAAGG ATTTGACGTT TCTAACATCA GTCTAGGCCT 100 

CAGACCAGAG AACATTTGCT CCTTGCCCGC CAGGTCGGTG TCCAGAAGCT 150 

GGTCGTTTTC GTTAACAAGG TCGATGCCGT TGAGGACC CA GAGATGTTGG 2 00 

3 0 AGCTTGTCGA ACTTGAAATG CGTGAACTCC TCAGCCACTA CAGTTTTGAG 250 

GGTGAGGAGA CCCCCATCAT TTTTGGCTCT GCTCTCTGTG CCCTCGAGTC 300 

CCGTCGACCT GAGCTTGGTG TCGAGAAGAT TGACGAGCTA TTGAACGCCG 350 

TCGACACCTG GATCCCCACC CCCGAGCGCG CCACTGATAA GCCTTTCCTC 400 

ATGTCCATTG AGGAAGTGTT CTCTATCTCT GGTCGTGGTA CCGTCGTCTC 450 

35 CGGTCGTGTT GAGCGTGGTA TCCTCAAGAA GGATTCCGAC GTCGAAATTG 50 0 

TTGGTGGCTC TACCACCCCT AT CAAGAC C A AGGTCACAGA TATCGAAACC 550 

TTCAAGAAGT CCTGCGATGA ATCTCGAGCT GGTGACAACT CTGGTCTCCT 600 

TCTCCGAGGT AT C AAGCGTG AGGACTTGAA GCGTGGAATG GTTGTTGCTG 650 

CCCCCGGATC CACCAAGGCT CACACCGACT TCATGGTCTC CCTCTACGTC 700 

40 CTGACTGAGG CTGAGGGTGG TCGTTCCAAC GGCTTCACCC ACAAGTACCG 750 

CCCTCAAATG TTCATCCGTA CTGCTGGTAT GTAACCAAAG TTTCCGCTAT 800 

TTACTAAGTA GATCATTGCT AACTTGTATT CCCTTCCGTA GACGAAGCCG 850 

CATCTTTCAG CTGGCCTGGA GAAGACCAAG ACAAGAAGGC TATGCCTGGT 900 

GACAACGTCG AGATGATTTG CAAAACCCTC CACCCCATTG CTGCCGAGGC 950 

45 TGGCCAACGA TTCA 964 



2) INFORMATION FOR SEQ ID NO: 508 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Yarrowia lipolytica 
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(B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 508 

5 GGTGCTATCA TTGTCGTTGC TGCTGGAGAC GGTTCCAAGC CCCAGACCCG 50 

AGAGCATCTG CTGCTTGCTC GACAGGTCGG TGTCCAGAAC CTGGTTGTGT 100 

TTGTTAACAA GGTTGATCAG ATTGATGATA AGGAGATTCT TGAGCTCGTT 150 

GACATGGAGA TGCGAGATCT GCTGACCCAG TACGGTTTTG ATGGTGACAA 2 00 

CACCCCCGTT GTCATGGGCT CTGCTCTGTG CGCTCTTGAG GGCAAGCAGA 250 

10 AGGATATTGG AGAGGACGCC ATCATGGCCC TTATGGATGC CGTTGATGAG 3 00 

CACATCCCTA CCCCTAACCG TGACCTTGAG AAGCCCTTCC TGATGCCCGT 350 

TGAGGACGTT TTCTCCATCT CTGGCCGAGG AACTGTTGTT ACTGGCCGAG 4 00 

TCGAGCGAGG AAACCTGAAG AAGGGTGAGG AAATCGAGAT TGTTGGCTAC 45 0 

AACAACAAGC CCATCAAGGC TGTTGTTACC GGTATTGAGA TGTTCAAGAA 500 

15 GGAGCTCGAG TCCGCCATGG CCGGTGACAA CGCCGGTATC CTGCTCCGAG 550 

GTATCAAGCG AGACGAGATC AAG CGAGGTA TGGTCATGTG CAAGCCTGGC 6 00 

AC CGTCAACG CCCACACCAA GTTCCTTGCT TCTCTTTACA TCATCCCCAC 650 

CGAGGAGGGT GGTCGAACCA GCTCTTTCGG CGCCAACTAC CGACCCCAGA 70 0 

TGTTCATCCG AACTTCTTCC GTCACCGCCA CTCTCACCTT CCCCGAGGGT 750 

2 0 ACCGACGAGT CCCAGACCGT CAACCCCGGT GACAACACTG AGATGGTTCT 80 0 

CGAGC TTGTT CACCCTACCG CCATTGAGGT CAACCAGCGA TTCA 844 



2 5 2) INFORMATION FOR SEQ ID NO : 509 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1067 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 

(B) STRAIN: Suarez-2 



40 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 509 



CTTGGACAAG CTGAAGAGCG AGCGTGAGCG TGGTATCACC ATTGACATTA 5 0 

CCCTGTGGAA GTTCGAAACT GGCAAGTACT ACTACACCGT CATTGACGCC 100 

CCCGGTCACC GTGACTTCAT TAAGAACATG ATTACGGGTA CCTCCCAGGC 150 

CGATGTTGCT ATGCTTGTCG TGCCCGCCGA GGCTGGTGGT TTCGAAGCTG 2 00 

45 CCTTCTCTAA GGAAGGTCAG ACCCGTGAGC ACGCTCTTTT GGCCTTCACC 2 50 

CTTGGTGTCA AGCAGATCAT TTGCGCCATC AACAAGATGG ACAAGTGCGA 3 00 

CTACAAGGAG GACCGTTACA GCGAAATC CA GAAGGAAGTT CAGGGTTACC 3 50 

TGAAGAAGGT CGGTTACAAC ATCGAGAAGG TGCCTTTCGT CGCCATCTCC 400 

GGTTTCATGG GTGACAACAT GGTTGAGCGC TCCACCAACA TGCCGTGGTA 4 50 

50 CAAGGGCAAG ACCTTGGTCG AGGCCCTCGA CATGATGGAG CCCCCGAAGA 50 0 

GGCCCGTCGA CAAGCCCCTG CGTCTTCCCC TCCAGGGTGT GTACAAGATC 55 0 

GGTGGTATCG GTACCGTCCC TGTCGGTCGT GTGGAGACTG GTCAGCTCAA 60 0 

GGC CGGTATG GTCCTCACCT TCGCCCCCAA CCCGATCACT ACTGAGTGCA 65 0 

AATCCGTCGA AATGCACCAC GAAGTTATCG ATGTTGCCAG CCCTGGTGAC 70 0 

55 AACGTTGGTT TCAACGTGAA GAACGTGTCC ACCTCTGACA TCCGCACTGG 750 

TCACGTCGCT TCTGACTCCA AGAACGACCC CGCCAAGGCC GCCGTGTCCT 800 

TCACCGCCCA GGTCATCATC TTGAACCACC CTGGTAC CAT CAAGGCCGGT 85 0 

TACTCCCCTG TGGTTGACTG CCACACTGCC CACATCTCGT GCAAATTCGA 90 0 

CGAGATCACC AGCCGTATGG ACAAGCGTAC CGGTAAGGCC CTTGAGGAGA 95 0 

60 ACCCCAAGAC CATCAAGAAC GGCGACGCCG CTATGGTCGT CCTGAAGCCG 1000 
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45 
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TGCAAGCCCA TGGTCGTCGA GGCCTTCACT GAATACGCTC CCCTTGGTCG 10 50 

TTTCGCCGTG CGTGACG 10 67 



2) INFORMATION FOR SEQ ID NO: 510 



( i ) SEQUENCE CHARACTERI STI CS : 
(A) LENGTH: 104 9 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510 

GTGAACGTGA ACGTGGTATT ACTATTGATA TTAC CTTATG GAAGTTCGAG 5 0 

ACCACCAAGT ACTACTACAC CGTCATTGAT GCCCCTGGTC ACCGTGACTT 100 

CATCAAGAAC ATGATTACGG GTACTTCTCA AGCCGATGTT GCTATGCTTG 150 

25 TTGTACCAGC TGAGGCTGGT GGTTTCGAGG CCGCTTTCTC CAAGGAAGGA 2 00 

CAGACCCGTG AGCACGCTCT TTTGGCTTTC ACCCTTGGTG TCAAACAGAT 250 

CATCTGTGCC ATTAACAAGA TGGACAAGTG CGACTACAAG GAGGACCGTT 3 00 

ACAGTGAAAT CCAGAAGGAA GTC CAGGGTT ACCTCAAGAA GGTCGGTTAC 350 

AATATTGAGA AGGTGCCCTT CGTTGCCATC TCCGGTTTCA TGGGAGACAA 400 

3 0 CATGGTTGAG CGTTCCACCA ACATGCCCTG GTATAAGGGA AAGACATTGG 4 50 

TCGAGGCCCT TGAT CAGATG GAACCCCCAA AGAGGCCCGT TGACAAGCCA 500 

CTTCGTCTTC CCCTCCAGGG TGTCTACAAG ATCGGTGGTA TCGGTACCGT 55 0 

CCCCGTCGGT CGTGTTGAAA CTGGTATGTT GAAGGCTGGT ATGATT CTAA 600 

CCTTTGCTCC TAACC CAATC AC CACTGAAT GCAAATCCGT TGAAATGCAC 650 

35 CACGAAACCG TTGAGGTTGC TTACCCCGGT GACAACGTCG GTTTCAACGT 700 

AAAGAACGTT TCTACTTCTG ACATTCGCAG TGGTCACGTT GCCTCTGATT 75 0 

CTAAGAACGA CCCTGCCAAG GCTGCTGTTT CCTTCACTGC CCAGGTCATT 800 

GTGCTCAACC ACCCTGGTAC CATTAAGGCC GGTTACTGCC CCGTCGTCGA 850 

TTGCCACACC GCTCACATTT CATGTAAATT CGAAGAGATC ACCAGCCGTA 90 0 

4 0 TGGACAAGCG TACCGGTAAA TCTCTTGAGG AAAACCCCAA GACCATCAAG 950 

AACGGTGACG C TGC CATGGT TGTGCTCAAG CCAATGAAGC CCATGGTTGT 1000 

CGAATCCTTC ACTGAGTATG CTCCTCTTGG TCGTTTCGCT GTTCGTGAC 1049 



2) INFORMATION FOR SEQ ID NO: 511 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1070 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Crithldia fasciculata 

(B) STRAIN: ATCC 11745 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511 

445 
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TGGACAAGCT GAAGGCGGAG CGCGAGCGCG GTATCACGAT CGATATTGCC 50 

CTGTGGAAGT TCGAGTCGCC CAAGTCCGTG TTCACGATCA TCGATGCCCC 100 

CGGCCACCGC GACTTCATCA AGAACATGAT CACCGGCACC TCCCAGGCCG 150 

5 ATGCCGCCAT TCTGATGATT GACTCGACCC AGGGTGGCTT CGAGGCTGGC 200 

AT CTC CAAGG ACGGCCAGAC CCGCGAGCAC GCCCTGCTTG CCTTCACGCT 250 

GGGCGTGAAG CAGATGGTTG TGTGCTGCAA CAAGATGGAC GACAAGACGG 3 00 

TGCAGTACGC CCAGGCCCGC TACGAGGAGA TCAGCAAGGA GGTCGGCGCG 35 0 

TACCTGAAGC GCGTGGGCTA CAACCCGGAG AAGGTGCGCT TCATCCCGAT 4 00 

10 CTCGGGCTGG CAGGGCGACA ACATGATCGA GAAGTCCGAC AACATGTCGT 450 

GGTACAAGGG TCCCACGCTG CTGGAGGCGC TCGACCTGCT GGAGGCCCCC 50 0 

GTGCGTCCGG TGGACAAGCC GCTGCGCCTG CCCCTGCAGG ACGTGTACAA 550 

GATCGGCGGT ATCGGCACTG TGCCCGTGGG CCGTGTGGAG ACCGGCGTGA 600 

TGAAGCCGGG CGACGTTGTG GTGTTCGCGC CTGCCAACGT GACGACCGAG 650 

15 GTGAAGTCGA TCGAGATGCA CCACGAGCAG CTGGCTGAGG CCGTGCCCGG 7 00 

CGACAACGTG GGCTTCAACG TGAAGAACGT GTCCGTGAAG GATATCCGCC 750 

GTGGTAACGT GTGCGGCAAC ACGAAGAGCG ACCCCCCGAA GGAGGCGGCC 800 

GACTTCACTG CCCAGGTGAT CGTGCTGAAC CACCCCGGCC AGAT CAGCAA 850 

CGGCTACGCG CCGGTGCTGG ACTGCCACAC GAGCCACATC GCGTGCAAGT 900 

2 0 TCGCGGACAT CGAGTC CAAG ATCGACCGCC GCTCTGGCAA GGAGCTGGAG 950 

AAGAGCCCGA AGGCCATCAA GTCCGGCGAT GCGGCCATCG TGAAGATGAT 1000 

CCCGCAGAAG CCGATGTGCG TGGAGGTGTT CAACGACTAC CCGCCGCTGG 105 0 

GCCGCTTCGC TGTCCGCGAT 107 0 

25 
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2) INFORMATION FOR SEQ ID NO: 512 

(i) SEQUENCE CHARACTERISTICS : 
3 0 (A) LENGTH: 1052 bases 

■'(B) TYPE: Nucleic acid ! * 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 

(B) STRAIN: HM1-IMMS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 

GCTGAAAGAG AAAGAGGAAT TACTATTGAT ATTTCATTAT GGAAATTCGA 50 

AACATCTAAA TACTACT TCA CTATTATTGA TGCCCCAGGT CAYAGAGATT 100 

45 TCATTAAGAA CATGATTACT GGAACTTCAC AAGCTGATGT TGCCATCCTT 150 

ATTGTTGCTG CTGGTACTGG WGAATTTGAA GCTGGTATTT CAAAGAATGG 200 

ACAAACCAGA GAACAYATTC TTCTTTCATA CACTCTTGGA GTTAAACAAA 2 50 

TGATTGTTGG WGTTAACAAG ATGGATG CTA TTCAATATAA ACAAGAAAGA 3 00 

TATGAAGAAA TTAAGAAAGA AATTAGTGCA TTCCTTAAGA AGACWGGATA 350 

50 TAATC CAGAC AAGATTCCAT TTGTCCCAAT TTCAGGATTC CAAGGAGATA 4 00 

ATATGATTGA ACCATCAACC AACATGCCAT GGTACAAAGG ACCAACATTA 4 50 

AT TGGAGC AC TTGATT CAGT CACACCACCA GAAAGACCAG TTGATAAACC 500 

ACTTAGACTT CCACTTCAAG ATGTTTAYAA GATTTCAGGT ATTGGAACTG 550 

TACCATGTGG AAGAGTTGAA ACTGGAGTTC TTAGACCAGG AACTATTGTT 600 

55 CAATTTGCAC CATCAGGAGT TTCATCTGAA TGTAAATCAG TTGAAATGCA 6 50 

TCACACAGCA CTTGCTCAAG CTATTCCAGG TGATAATGTT GGATTCAATG 7 00 

TTAGAAAYTT AACAGTTAAA GATATTAAGA GAGGAAATGT AGCATCAGAT 7 50 

GCTAAGAATC AACCAGCTGT TGGATGTGAA GATTTCACTG CTCAAGTCAT 8 00 

TGTCATGAAC CATCCAGGAC AAATTAGAAA GGGATATACA CCAGTTCTTG 850 

60 ATTGCCATAC ATCACACATT GCATGTAAAT TCGAAGAATT ATTAAGCAAG 9 00 
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ATTGATAGAA GAACAGGTAA ATCCATGGAA 
TAAGAATGGA GATTCAGCAC TTGTTAAGAT 
GTGTTGAAGA ATTTGCTAAA TTCCCACCAT 
GA 



GGAGGAGAAC CAGAATATAT 950 
TGTTCCAACT AAACCACTTT 10 00 

TGGGAAGATT TGCTGTTAGA 1050 

1052 



2) INFORMATION FOR SEQ ID NO: 513 

10 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1082 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Giardia lamblia 
2 0 (B) STRAIN: Faubert-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513 

GACGAGCGCG AGCGCGGGAT CACGATCAAC ATCGCGCTCT GGAAGTTCGA 50 

2 5 GACGAAGAAG TACATCGTCA CGATCATCGA CGCCCCGGGC CACCGCGACT 100 

TCATCAAGAA CATGATCACG GGGACGTCCC AGGCCGACGT CGCGATCCTC 150 

GTCGTCGCGG CGGGCCAGGG CGAGTTCGAG GCCGGGATCT CGAAGGACGG 2 00 

CCAGACGCGC GAGCACGCGA CCCTTGCGAA CACGCTCGGG ATCAAGACGA 25 0 

TGATCATCTG CGTCAACAAG ATGGACGACG GCCAGGTCAA GTACTCGAAG 3 00 

3 0 GAGCGCTACG ACGAGATCAA GGGCGAGATG ATGAAGCAGC TCAAGAACAT 3 50 

CGGCTGGAAG AAGG CCGAGG AGTTCGACTA CATCCCGACG TCCGGCTGGA 4 00 

CCGGGGACAA CATCATGGAG AAGTC CGACA AGATGCCCTG GTACGAGGGC 450 

CCGTGCCTGA TCGACGCGAT CGACGGGCTC AAGGCCCCGA AGCGCCCGAC 5 00 

CGACAAGCCC CTCCGCCTCC CGATC CAGGA CGTCTACAAG ATCTCGGGCG 55 0 

3 5 T CGGGAC CGT CCCCGCGGGC CGCGTCGAGA CGGGCGAGCT CGCGCCCGGG 600 

ATGAAGGTCG TCTTCGCCCC GACGTCCCAG GTCTCGGAGG TCAAGTCCGT 650 

CGAGATG CAC CACGAGGAGC TCAAGAAGGC CGGGCCCGGG GACAACGTCG 700 

GCTTCAACGT CCGCGGGCTC GCCGTCAAGG AC CTCAAGAA GGGCTACGTC 750 

GTCGGGGACG TGACGAACGA CCCGCCCGTC GGCTGCAAGA GCTTCACCGC 8 00 

4 0 CCAGGTCATC GTCATGAACC AC CCGAAGAA GATCCAGCCC GGCTACACGC 850 

CCGTCATCGA CTGCCACACC GCGCACATCG CGTGCCAGTT CCAGCTCTTC 900 

CTCCAGAAGC TCGACAAGCG CACGCTCAAG CCCGAGATGG AGAACCCGCC 950 

CGACGCAGGC CGCGGCGATT GCAT CATCGT CAAGATGGTC CCCCAGAAGC 1000 

CCCTGTGCTG CGAGACGTTC AACGACTACG CGCCCCTCGG CCGCTTCGCC 1050 

45 GTCCG CGACA TGCGCCAAAC CGTTGCCGTC GG 1082 



2) INFORMATION FOR SEQ ID NO: 514 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Leishmania tropica 
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(B) STRAIN: ATCC 30816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 

5 ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG CGGCATCACG 5 0 

ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCCG TGTT CACGAT 100 

CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG ATCACGGGCA 15 0 

CGTCGCAGGC GGACGCCGCC ATCCTGATGA TCGACTCGAC GCATGGTGGC 200 

TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACCCGCGAGC ACGCGCTGCT 2 50 

10 TGCCTTCACT CTTGGCGTGA AG CAGATGGT GGTGTGCTGC AACAAGATGG 3 00 

ACGACAAGAC GGTGACGTAC GCGCAGTCGC GCTACGATGA GATCAGCAAG 350 

GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG AGAAGGTGCG 40 0 

CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC GAGAAGTCGG 450 

ACAACATGCC GTGGTACAAG GGTCCCACGC TGCTGGACGC GCTCGACATG 5 00 

15 CTGGAGC CGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC TGCCCCTGCA 550 

GGACGTGTAC AAGATCGGCG GTATCGGGAC GGTGCCCGTG GGGCGCGTGG 60 0 

AGACCGGCAT CATGAAGCCG GGCGACGTGG TGACGTTCGC GCCCGCCAAC 650 

GTGACGACTG AGGTGAAGTC GATCGAGATG CACCACGAGC AGCTGGCGGA 70 0 

GGCGCAGCCC GGCGACAACG TCGGCTTCAA CGTGAAGAAC GTGTCGGTGA 750 

20 AGGACATCCG CCGTGGTAAC GTGTGCGGCA ACTCGAAGAA CGACCCGCCG 80 0 

AAGGAGGCGG CCGACTTCAC GGCGCAGGTG ATCGTGCTGA ACCACCCCGG 850 

CCAGATCAGC AACGGCTACG CGCCGGTGCT GGACTGCCAC ACGAGC CACA 900 

TTGCGTGCCG CTTCGCGGAA ATCGAGTCCA AGATCGACCG CCGCTCCGGC 950 

AAGGAGC TGG AGAAGAACCC CAAGGCGATC AAGT CTGGCG ATGCCGCGAT 1000 

2 5 CGTGAAGATG GTGCCGCAGA AGCCGATGTG CGTGGAGGTG TTCAACGACT 1050 

ACGCGCCGCT GGGCCGCTTT GCCGTGCGCG ACATGCGCCA AACCGTTG 1098 



30 2) INFORMATION FOR SEQ ID NO: 515 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE : Genomi c DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lelshmania aethiopica 

(B) STRAIN: ATCC 50119 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 



TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 5 0 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATC CTGATG ATCGACTCGA CGCATGGTGG 200 

50 CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 2 50 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGAGG AGATCAGCAA 3 50 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGT CG 4 50 

55 GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGC GACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

60 AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 
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AAGGACATCC 
GAAGGAGGCG 
GCCAGATCAG 
ATTGCGTGCC 
5 CAAGGAGCTG 
TCGTGAAGAT 
TACGCGCCGC 
CGTC 



GCCGTGGCAA 
GCCGACTTCA 
CAACGGCTAC 
GCTTCGCGGA 
GAGAAGAACC 
GGTGCCGCAG 
TGGGCCGCTT 



CGTGTGCGGC 
CGGCGCAGGT 
GCGCCGGTGC 
AATCGAGTCC 
CCAAGGCGAT 
AAGCCGATGT 
TGCCGTGCGC 



AACTCGAAGA 
GATCGTGCTG 
TGGACTGCCA 
AAGATCG AC C 
CAAGTCTGGC 
GCGTGGAGGT 
GACATGCGCC 



ACGACCCGCC 8 00 

AACCACCCCG 85 0 

CACGAGCCAC 900 

GCCGCTCCGG 950 

GATGCCGCGA 1000 

GTTCAACGAC 1050 

AAACCGTTGC 110 0 
1104 



10 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 516 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516 



TACGCGTGGG 
GATCGACATT 

3 0 TCATCGATGC 

ACGTCGCAGG 
CTTCGAGGCT 
TTGCCTTCAC 
GACGACAAGA 
35 GGAGGTGGGC 
GCTTCATCCC 
GACAACATGC 
GCTGGAGCCG 
AGGACGTGTA 

4 0 GAGACCGGCA 

CGTGACGACT 
AGGCGCAGCC 
AAGGACATCC 
GAAGGAGGCG 

4 5 GCCAGATCAG 
ATTGCGTGCC 
CAAGGAGCTG 
TCGTGAAGAT 
TACGCGCCGC 

50 CGTCGG 



TGCTCGACAA 
GCGCTGTGGA 
GCCCGGCCAC 
CGGACGCCGC 
GGCATCTCGA 
KCTTGGCGTG 
CGGTGACGTA 
GCGTACCTGA 
GATCTCGGGC 
CGTGGTACAA 
CCGGTGCGCC 
CAAGATC GGC 
TCATGAAGCC 
GAGGTGAAGT 
CGGCGACAAC 
GCCGTGGTAA 
GCCGACTTCA 
CAACGGCTAC 
GCTTCGCGGA 
GAGAAGAACC 
GGTGCCGCAG 
TGGGCCGCTT 



GCTGAAGGCG 
AGTTCGAGTC 
CGCGACTTCA 
CATC C TGATG 
AGGACGGCCA 
AAGCAGATGG 
CGCGCAGTCG 
AGCGCGTGGG 
TGGCAGGGCG 
GGGTCCCACG 
CGGTGGACAA 
GGTATCGGGA 
GGGCGACGTG 
CGATCGAGAT 
GTCGGCTT C A 
CGTGTGCGGC 
CGGCGCAGGT 
GCGCCGGTGC 
AATCGAGTCC 
CCAAGGCGAT 
AAGCCGATGT 
TGCCGTGCGC 



GAGCGCGAGC 
GCCCAAGTCC 
TCAAGAACAT 
ATCGACTCGA 
GACCCGCGAG 
TGGTGTGCTG 
CGCTACGATG 
CTACAACCCG 
ACAACATGAT 
CTGCTGGACG 
GCCGCTGCGC 
CGGTGCCCGT 
GTGACGTTCG 
GCACCACGAG 
ACGTGAAGAA 
AACTCGAAGA 
GATCGTGCTG 
TGGACTGCCA 
AAGATCGACC 
CAAGTCTGGC 
GCGTGGAGGT 
GACATGCGCC 



GCGGCATCAC 50 

GTGTTCACGA 100 

GATCACGGGC 150 

CGCATGGTGG 200 

CACGCGCTGC 25 0 

CAACAAGATG 3 00 

AGATCAGCAA 350 

GAGAAGGTGC 400 

CGAGAAGTCG 4 50 

CGCTCGACAT 50 0 

CTGCCCCTGC 550 

GGGGCGCGTG 600 

CGCCCGCCAA 650 

CAGCTGGCGG 700 

CGTGTCGGTG 750 

ACGACCCGCC 80 0 

AACCACCCCG 850 

CACGAGCCAC 900 

GCCGCTCCGG 95 0 

GATGCCGCGA 1000 

GTTCAACGAC 1050 

AAACCGTTGC 1100 
1106 



2) INFORMATION FOR SEQ ID NO: 517 

55 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 



449 



CA 02283458 1999-09-28 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Leishmania donovani subsp . donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517 



10 TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 5 0 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA T CAAGAAC AT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

15 TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

2 0 GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 55 0 

AGGACGTGTA CAAGAT CGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGAC CGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCAC CACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

25 AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGAT CAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 95 0 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 100 0 

30 TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTG 1099 



35 2) INFORMATION FOR SEQ ID NO: 518 



(i) SEQUENCE CHARACTERISTICS-: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani subsp. infantum 

(B) STRAIN: MOU 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 

50 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 2 00 

55 CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 2 50 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 3 50 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 4 50 

60 GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 
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GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTAT CGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAG AC CGG C A TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT C GAT CGAGAT GCACCACGAG CAGCTGGCGG 700 

5 AGGCGCAGCC CGG CGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 75 0 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 8 50 

G C C AGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 90 0 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

10 CAAGGAGCTG GAGAAGAAC C CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 100 0 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTT 10 98 



15 



25 



55 



2) INFORMATION FOR SEQ ID NO : 519 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1071 bases 
20 (B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lelshmania enriettii 

(B) STRAIN: ATCC 50120 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519 

CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

GCTGTGGAAG TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGATGCGC 100 

CCGGCCACCG CGACTTCATC AAGAACATGA TCACCGGCAC GTCGCAGGCC 15 0 

3 5 GACGCCGCCA TCCTGATGAT CGACTCGACC CAGGGCGGCT TCGAGGCTGG 2 00 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTC GCCTTCACGC 25 0 

TTGGTGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

GTGCAGTACT CGCAGGCGCG CTACGAGGAG AT CAGCAAGG AGGTGGGCGC 3 50 

GTACCTGAAG CGCGTCGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 4 00 

4 0 TCTCGGGCTG GCAGGGCGAC AACATGATCG ACAAGTCGGA CAGCATGCCG 4 50 

TGGTACAAGG GGCCCACGCT GCTGGACGCG CTCGACATGC TGGAGGCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

ATGAAGCCTG GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA 650 

45 GGTGAAGTCG ATCGAGATGC AC CACGAGCA GCTGGCGGAG GCGGTGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 800 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCTGGC CAGATCAGCA 850 

ACGGCTACGC GCCGGTGCTG GACTGCCACA CGAGCCACAT CGCGTGCCGC 900 

50 TTCGCGGATA TCGAGTCCAA GATCGACCGC CGCTCTGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCCGGCGA TGCGGCCATC GTGAAGATGG 1000 

TGC CGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CCCGCCGCTG 1050 

GGGCGCTTCG CTGTGCGCGA C 1071 



2) INFORMATION FOR SEQ ID NO: 52 0 



( i ) SEQUENCE CHARACTE R I STI CS : 
60 (A) LENGTH: 1071 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanla gerbilli 

(B) STRAIN: ATCC 50121 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520 

CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 5 0 

GCTGTGGAAG TTCGAGTCGC CCAAGTCCGT GTTCACGATC ATCGATGCGC 100 

15 CCGGCCACCG CGACTTCATC AAGAACATGA TCACGGGCAC GTCGCAGGCG 150 

GACGCCGCCA TCCTGATGAT CGACTCGACG CATGGTGGCT TCGAGGCTGG 2 00 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTT GCCTTCACTC 2 50 

TTGGCGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 3 00 

GTGACGTACG CGCAGTCGCG CTACGATGAG ATCAGCAAGG AGGTGGGCGC 350 

2 0 GTAC CTGAAG CGCGTGGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 4 00 

TCTCGGGCTG GCAGGGCGAC AACATGATCG AGAAGT CGGA CAACATGCCG 45 0 

TGGTACAAGG GTCCCACGCT GCTGGACGCG CTCGACATGC TGGAGCCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 5 50 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

25 ATGAAGC CGG GCGACGTGGT GACGTTCGCG CCCGCCAACG TGACGACTGA 650 

GGTGAAGTCG AT CGAGATGC ACCACGAGCA GCTGGCGGAG GCGCAGCCCG 7 00 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGT CGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 8 00 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCCGGC CAGATCAGCA 8 50 

3 0 ACGGCTATGC GCCGGTGCTG GACTGCCACA CGAGC CACAT TGCGTGCCGC 9 00 

" TTCGCGGAAA TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCTGGCGA TGCCGCGATC GTGAAGATGG 10 00 

TGC CGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CGCGCCGCTG 1050 

GGCCGCTTTG CCGTGCGCGA C 1071 

35 



2) INFORMATION FOR SEQ ID NO: 521 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1114 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmeuiia hertigi subsp. hertigi 
50 (B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521 

TCGTTCAAGT ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG 50 

55 CGGTATCACG ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCGG 100 

TGTTCACGAT CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG 150 

AT CAC CGGCA CGTCGCAGGC GGATGCTGCC ATTCTGATGA TCGATTCGAC 2 00 

GCAGGGTGGC TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACGCGCGAGC 250 

ACGCGCTGCT GGCCTTCACG CTGGGCGTGA AGCAGATGGT TGTGTGCTGC 3 00 

60 AACAAGATGG ACGACAAGAC GGTGCAGTAC GCGCAGGCGC GCTAC GAGGA 3 50 
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GATCAGCAAG GAGGTGGGCG CGTACCTGAA 
AGAAGGTGCG CTTCATCCCG ATCTCGGGCT 
GAGAAGTCCG ACAACATGTC GTGGTACAAG 
GCTGGACATG CTGGAGGCGC CGGTGCGCCC 
5 TGCCCCTGCA GGACGTGTAC AAGATCGGCG 
GGCCGTGTGG AGACCGGCAT CATGAAGCCC 
GCCCGCCAAC GTGACGACGG AGGTGAAGTC 
AGCTGCAGGA GGCTGTGCCC GGCGACAACG 
GTGTCGGTGA AGGACATCCG CCGTGGTAAC 

10 CGACCCGCCG AAGGAGGCGG CTGACTTCAC 
ACCACCCCGG CCAGATCAGC AACGGCTACG 
ACCAGCCACA TCGCGTGCCG CTTCGCGGAC 
CCGCTCCGGC AAGGAGCTGG AG AAGAAC C C 
ACGCCGCCAT CGTGAAGATG GTGCCGCAGA 

15 TTCAACGACT ACCCGCCGCT GGGCCGCTTT 
AACCGTTGCC GTCG 



GCGCGTGGGC TACAACCCGG 4 00 

GGCAGGGCGA CAACATGATC 4 50 

GGTCCCACGC TGCTGGAGGC 50 0 

GGTGGACAAG CCGCTGCGCC 55 0 

GCATTGGCAC GGTGCCGGTG 6 00 

GGCGACGTGG TGACGTTCGC 650 

GATCGAGATG CACCACGAGC 70 0 

TCGGCTTCAA CGTGAAGAAC 750 

GTGTGTGGCA ACT CGAAGAA 80 0 

GGCGCAGGTG ATCGTGCTGA 85 0 

CGCCGGTGCT GGACTGCCAC 90 0 

ATCGAGTCGA AGATCGACCG 950 

CAAGTCCATC AAGTCCGGCG 1000 

AGCCGATGTG CGTGGAGGTG 1050 

GCGGTGCGCG ACATGCGCCA 110 0 

1114 



20 2) INFORMATION FOR SEQ ID NO: 522 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN : ATCC 50122 



35 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 522 



TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATTCTGATG ATCGACTCGA CGCATGGCGG 200 

40 CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CGGTGACGTA CGCGCAGTCR CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 45 0 

45 GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGAT CGAGAT GCACCACGAG CAGCTGGCGG 7 00 

50 AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAT GCGCCGGTGC TGGACTGCCA CACGAGCCAC 90 0 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 95 0 

55 CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCGG 1106 

60 
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2) INFORMATION FOR SEQ ID NO: 523 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 ( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523 



TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTC GAGTC GCCGAAGTCC GTGTTCACGA 100 

2 0 TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGC CA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

25 GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 4 50 

GACAACATG C CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

3 0 GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 85 0 

35 GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAG CTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGC CGCGA 10 00 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

40 CGTCG 1105 



2) INFORMATION FOR SEQ ID NO: 524 



45 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 

60 TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 
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GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 10 0 

TCATCGATGC GCCCGGCCAC CGCGACTTCA T CAAGAAC AT GATCACGGGC 15 0 

ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGAC GGC C A GACCCGCGAG CACGCGCTGC 250 

5 TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 30 0 

GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 35 0 

GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

10 GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTAT CGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

15 AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 85 0 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGAT CGACC GCCGCTCCGG 95 0 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGCCGCGA 1000 

2 0 TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 105 0 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTT 109 8 



25 2) INFORMATION FOR SEQ ID NO: 525 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1081 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY : ; Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: II WT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 
40 - 

GCTGAAGGCG GAGCGCGAGC GCGGCATCAC GATCGACATT GCGCTGTGGA 50 

AGTTCGAGTC GCCCAAGTCG GTGTTCACGA TCATCGATGC GCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCGCAGG CGGACGCGGC 150 

CATCCTGATG ATCGACTCGA CGCACGGTGG GTTCGAGGCT GGCATCTCGA 2 00 

4 5 AGGACGGGCA GACGCGCGAG CACGCGCTGC TTGCCTTCAC TCTTGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGA CGGTGATGTA 3 00 

CGCGCAGTCG CGCTACGATG AGATCAGCAA GGAGGTGGGC GCGTACCTGA 350 

AGCGCGTGGG CTACAACCCG GAGAAGGTGC GCTTCATCCC GATCTCGGGC 4 00 

TGGCAGGGCG ACAACATGAT CGAGAAGTCG GACAACATGC CGTGGTACAA 450 

5 0 GGGTCCCACG CTGCTGGACG CGCTCGACAT GCTGGAGGCG CCGGTGCGCC 500 

CGGTGGACAA GCCGCTGCGC CTGCCCCTGC AGGACGTGTA CAAGATCGGC 550 

GGCATCGGCA CGGTGCCCGT GGGCCGCGTG GAGACCGGCA TCATGAAGCC 600 

GGGCGACGTG GTGACGTTCG CGCCCGCGAA CGTGACGACG GAGGTGAAGT 650 

CGATCGAGAT GCACCACGAG CAGCTGGCGG AGGCGCAGCC CGGCGACAAC 700 

55 GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG AAGGACATCC GCCGTGGGAA 750 

CGTGTGCGGT AACTCGAAGA ACGACCCGCC GAAGGAGGCC GCCGACTTCA 800 

CGGCGCAGGT GATCGTGCTG AACCACCCCG GCCAGATCAG CAACGGCTAC 85 0 

GCGCCGGTGC TGGACTGCCA CACGAGCCAC ATCGCGTGCC GGTTCGCGGA 900 

CATCGAGTCC AAGATTGACC GCCGCTCCGG CAAGGAGCTG GAGAAGAACC 95 0 

6 0 CCAAGGCGAT CAAGTCCGGC GATGCCGCGA TCGTGAAGAT GGTGCCGCAG 10 0 0 
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15 



55 



AAGCCGATGT GCGTGGAGAT GTTCAACGAC TACGCGCCGC TTGGCCGCTT 1050 
TGCTGTGCGC GACATGCGCC AAACCGTTGC C 1081 



2) INFORMATION FOR SEQ ID NO: 526 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1102 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 5 0129 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526 

AAATACGCGT GGGTGCTCGA CAAGCTGAAG GCGGAGCGCG AGCGCGGCAT 50 

CACGATCGAC ATTGCGCTGT GGAAGTTCGA GTCGCCCAAG TCCGTGTTCA 10 0 

CGATCATCGA TGCGCCCGGC CACCGCGACT TCATCAAGAA CATGATCACG 150 

25 GGCACGTCGC AGGCGGACGC CGCCATCCTG ATGAT CGACT CGACGCATGG 200 

TGGCTTCGAG GCTGGCATCT CGAAGGACGG CCAGACCCGC GAGCACGCGC 250 

TGCTTGCCTT CACKCTTGGC GTGAAGCAGA TGGTGGTGTG CTGCAACAAG 300 

ATGGACGACA AGACGGTGAC GTACGCGCAG TCGCGCTACG ATGAGATCAG 35 0 

CAAGGAGGTG GGCGCGTACC TGAAGCGCGT GGGCTACAAC CCGGAGAAGG 40 0 

3 0 TGCGCTTCAT CCCGATCTCG GGCTGGCAGG GCGACAACAT GATCGAGAAG 450 

TCGGACAACA TGCCGTGGTA CAAGGGTCCC ACGCTGCTGG ACGCGCTCGA 500 

CATGCTGGAG CCGCCGGTGC GCCCGGTGGA CAAGCCGCTG CGCCTGCCCC 550 

TGCAGGACGT GTACAAGATC GGCGGTATCG GGACGGTGCC CGTGGGGCGC 600 

GTGGAGACCG GCATCATGAA GCCGGGCGAC GTGGTGACGT TCGCGCCCGC 6-50 

3 5 CAACGTGACG ACTGAGGTGA AGTCGATCGA GATGCACCAC GAGCAGCTGG 700 

CGGAGGCGCA GCCCGGCGAC AACGTCGGCT TCAACGTGAA GAACGTGTCG 750 

GTGAAGGACA TCCGCCGTGG TAACGTGTGC GGCAACTCGA AGAACGACCC 800 

GCCGAAGGAG GCGGCCGACT TCACGGCGCA GGTGATCGTG CTGAACCACC 850 

CCGGCCAGAT CAGCAACGGC TACGCGCCGG TGCTGGACTG CCACACGAGC 90 0 

4 0 CACATTGCGT GCCGCTTCGC GGAAATC GAG TCCAAGATCG ACCGCCGCTC 95 0 

CGGCAAGGAG C TGGAGAAGA ACCCCAAGGC GATCAAGTCT GGCGATGCCG 100 0 

CGATCGTGAA GATGGTGCCG CAGAAGC CGA TGTGCGTGGA GGTGTTCAAC 1050 

GACTACGCGC CGCTGGGCCG CTTTGCCGTG CGCGACATGC GCCAAACCGT 110 0 

TG 1102 

45 

2) INFORMATION FOR SEQ ID NO: 527 

50 ( i ) SEQUENCE CHARACTE R I STI CS : 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neospora caninum 
6 0 (B) STRAIN: Suarez-4 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 

GGACRAACTT AAAGCTGAAC GTGAGCGTGG TAT C AC CATT GATATCTCCC 5 0 

5 TGTGGAAATT TGAGAC CAGC AAGTACTATG TTACCATCAT TGATGCCCCA 10 0 

GGACACAGAG ACTTCATCAA AAACATGATT ACAGGCACAT CCCAGGCTGA 150 

CTGTGCTGTC CTGATTGTTG CTGCTGGTGT TGGTGAATTT GAAGC CGGTA 2 00 

TCTCCAAGAA CGGGCAGACC CGTGAGCATG CCCTTNTGGC TTACACCCTG 250 

GGTGTGAAAC AACTAATTGT TGGCGTTAAC AAAAKGGATT CCACTGAGCC 300 

10 ACCCTATAGC CARAAGAGAT ACGARGAAAT TGTTAAGGAA GTCAGCMCCT 350 

AYNTTAAAAA AATTGGYTAC AACCCCGACA CAGTANCATT TGKGCCAATT 4 00 

TNTGGCTGGA ATGGTGACAA CATGCTGGAN CCAAGTGCTA ATATGCCATG 450 

GTTCAAGGGA TGGAAAGT CM CCCGTAAGGA CGGCAATGCC AGKGGAACCM 500 

CCCTGCTTGA AGCTYTGGAT TGCATTYTGC CACCAAYTTG CCCAACTGAC 550 

15 AAACCCTTGC GTTTGCCTYT CCAGGATGTC TATAAAATTG GKGGTATTGG 600 

TACTGTCCCT GTGGGTCGTG TGGAGACTGG TGTTCTCAAA CCTGGCATGG 650 

TGGTCACCTT TGCTCCAGTC AATGTAACAA CTGAAGTGAA GTCTGTAGAA 700 

ATGCACCATG AAGCATTGAG TGAAGCCCTT CCTGGGGACA ATGTGGGCTT 75 0 

CAATGTCAAG AACGTGTCTG TCAAAGATGT CCGTCGTGGC AATGTGGCTG 8 00 

2 0 GTGACAGCAA AAATGATC CA CCCATGGAAG CTGCTGGCTT CACAGC TCAG 850 

GTGATTATTT TGAACCATCC AGG CCAAAT C AGTGCTGGAT ATGCACCTGT 900 

GCTGGATTGT CACACAGCTC ACATTGCTTG CAAGTTTGCT GAGCTGAAGG 950 

AGAAGATTGA TCGTCGTTCT GGGAAAAAGC TGGAAGATGG CCCTAAATTC 10 00 

TTGAAATCTG GTGACGCTGC CATCGTTGAT ATGGTTCCTG GCAAGCCCAT 1050 

25 GTGTGTCGAG AGCTTCTCTG ATTATCCTCC CCTGGGCCGT TTTGCTGTGC 1100 

GTGAC 1105 



30 2) INFORMATION FOR SEQ ID NO: 528 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 935 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichomonas vaginalis 

(B) STRAIN: ATCC 30001 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528 



GCACATCCCA GGCTGATGCT GCTATCCTTG TCATCGACTC CACACTCGGT 50 

GGYTTCGAAG CCGGTATCGC TGAACAAGGC CAGACACGTG AACACGCTCT 100 

TCTTGCCTTC ACACTCGGCA TCAAGCAGGT CATTGTCGCC GTCAACAAGA 150 

TGGATGACAA GACAGTCAAC TACAACAAGG CYCGTTTCGA CGAAATCACA 2 00 

50 GCCGAAATGA CACGCATCCT TACAGGC AT C GGCTACAAGC CAGAAATGTT 2 50 

CCGCTTCGTC CCAATCTCCG GCTGGGCTGG CGACAACATG ACAGAGAAGT 3 00 

CTCCAAACAT GCCATGGTAC AATGGCCCAT ACCTTCTTGA AGCCCTCGAT 350 

TCCCTTCAGC CACCAAAGCG CCCATTCGAC AAGCCACTCC GTCTTCCACT 4 00 

CCAGGATGTC TACAAGATCA ACGGTATCGG TACAGTTC CA GTCGGCCGTG 4 50 

55 TCGAATCCGG CACAATGAAG CCAGGCATGA TCGTTAACTT CGCCCCATCC 5 00 

ACAGTTACAG CTGAAGTTAA GTCCATCGAA ATGCACCACG AATCCCTTCC 550 

AGAGGCTCTT CCAGGTGACA ACATCGGCTT CAACGTCAAG AACGTTTCCA 600 

CAGC TGATGT CAAGCGTGGC TACGTCGTTG GTGATACAAA GCGTGACCCA 650 

CCAGTCGAAT GCGCTTCCTT CACAGCTCAR ATGATCATCT CCAACCACCC 700 

60 AGGCAAGATC CACGCCGGCT AC CAGCCAGT TTTCGACTGC CACACAGCTC 750 
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ACATCGCCTG CAAGTTCGAC 
GGCAAGAAGG CTACAGAGAA 
TATCGTCGAG GTTGTCCCAT 
AGTACCCACC ACTCGGCCGT 



AAGCTCATCC AGCGTATCGA 
CCCAGAATAC ATTCAGAAGG 
CCAAGCCACT CGTCGTCGAG 
TTCGCCATCC GTGAT 



TCGTCGCCAC 800 
ATGATGCCGC 850 
TCCTTCCAGG 900 
935 



2) INFORMATION FOR SEQ ID NO: 52 9 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1065 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei subsp. brucei 
20 (B) STRAIN: EATR0795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529 

AAGCTGAAGG CTGAGCGCGA ACGTGGTATC ACGAT CGACA TTGCACTGTG 50 

2 5 GAAATT CGAG TCACCCAAGT CTGTCTTCAC TATTATTGAT GCTCCTGGGC 100 

ACCGTGACTT CATCAAGAAC ATGATCACCG GCACATCGCA AGCCGACGCA 150 

GCCATCCTCA TCATTGCCTC TGCGCAGGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGA CAGACCCGCG AGCACGCGTT GCTGGCCTTC ACTTTGGGTG 250 

TGAAGCAGAT GGTTGTGTGC TGCAACAAGA TGGACGACAA GACTGTGAAC 300 

3 0 TACGGACAGG AGCGGTATGA CGAGATTGTG AAGGAGGTGT CTGCTTACAT 350 

CAAGAAGGTT GGGTACAACG TGGAGAAGGT GCGCTTCGTC CCCATCTCCG 4 00 

GATGGCAGGG CGACAACATG ATTGAGAAAT CCGAGAAGAT GCCATGGTAC 450 

AAGGGTC CAA CGCTCCTGGA GGCACTAGAC ATGCTGGAGC CACCAGTGCG 50 0 

TCCGAGCGAC AAGCCCCTGC GTCTGCCACT GCAGGACGTG TACAAGATCG 550 

3 5 GTGGTATTGG CACCGTGCCC GTTGGTCGTG TGGAGACCGG CGTGATGAAG 600 

CCTGGTGATG TGGTGACGTT TGCCCCCGCC AACGTGACGA CCGAGGTGAA 650 

ATCGATCGAG ATGCACCACG AGCAGCTCGC TGAGGCGACC CCCGGTGACA 700 

ACGTCGGCTT TAACGTGAAG AACGTTTCTG TAAAGGACAT CCGCCGTGGC 750 

AACGTCTGCG GTAACACCAA GAACGACCCC CCAAAGGAGG CCGCCGACTT 800 

4 0 CACGGCACAG GTGATCATCC TGAACCACCC CGGACAGATT GGAAACGGTT 850 

ATGCGCCCGT GCTGGACTGC CACACATCGC ACATTGCCTG CAAGTTCGCG 900 

GAGATCGAGT CGAAGATCGA CCGTCGCTCT GGCAAGGAGC TGGAGAAGGC 950 

TCCCAAGTCG ATCAAGTCTG GCGACGCCGC GATCGTGCGC ATGGTGCCGC 100 0 

AGAAGCCTAT GTGCGTGGAG GTCTTCAACG ACTACGCGCC ACTCGGCCGC 1050 

4 5 TTTGCCGTGC GTGAC 1065 



50 



2) INFORMATION FOR SEQ ID NO : 530 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Crithidia fasciculata 
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(B) STRAIN: ATCC 11745 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530 

c TTCGCGGAGG GCGTGCCGCC GGTGCTGACG GCGCTGGACG TGACGGAGGA 50 

CCTCGGCCGC GACGAGCCGC TGACGCTGGA GATTGTGCAG CACTTGGACG 100 
CGAACACCGG CCGCTGCATT GCCATGCAGA CGACGGATCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC GGGCGGCAAC ATCTCCGTGC CGGTTGGCCG 200 
CGAGACGCTG GGCCGCATCT TCAACGTGCT CGGCGACGCG ATCGACCAGC 2 50 

10 GCGGTGTGGT GGGCGAGAAG ATGCGCATGC CGATCCACGC CGAGGCGCCG 3 00 

AAGCTGGCGG ACCAGGCCGC GGAGGACGCG ATTCTGACGA CCGGCATCAA 3 50 

GGTGATCGAC CTGATTCTGC CGTACTGCAA GGGTGGCAAG ATCGGG CTGT 400 
TCGGCGGTGC TGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 45 0 

AACGTGGCCA AGGGCCACGG TGGTTTCTCC GTGTTCGCCG GCGTTGG CGA 500 

15 GCGCACCCGC GAGGGCACGG ATCTGTACCT GGAGATGATG CAGTCGAAGG 550 
TCATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTGGTGTA CGGCCAGATG 600 
AACGAGCCCC CGGGTGCGCG TGCGCGTGTT GCGCAGTCTG CGCTGACGAT 6 50 

GGCGGAGTAC TTC CGTGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 
ACAACATCTT CCGCTTCACC CAGGCCAACT CCGAGGTGTC CGCCCTGCTG 7 50 

2 0 GGCCGCATTC CCGCCGCCGT GGGCTACCAG CCGACGCTTG CCGAGGATCT 8 00 

TGGTATGCTG CAGGAGCGCA TTACGTCGAC GACGAAGGGC TCGATTACGT 8 50 

CTGTGCAGGC CGTGTACGTG CCGGCCGATG ATATCACGGA TCCGGCGCCG 90 0 

GCGACGACCT TCTCGCACCT GGATGCGACG ACGGTGCTGG ACCGCGCGGT 950 

TGCCGAGTCT GGCATCTACC CCGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

25 GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATG CTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

CGTGCTGGGT AT CGACGAGC TGAGCGAGGA GGACAAGCTT GTGGTGGACC 1150 

GCGCTCGCAA GGTGACGCGC TTCCTGTCGC AGCCGTTCCA GGTGGC CGAG 12 00 

GTGTTCACCG GCATGACGGG CCACTACGTG CAGCTGGAGG ACACAGTGGA 12 5 0 

3 0 GTCGTTCTCT GGCCTGCTGA TGGGCTCGTA CGACCAGATC CCGGAGA 12 97 



2) INFORMATION FOR SEQ ID NO: 531 



35 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

4 0 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmania tropica. 

(B) STRAIN: ATCC 30816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531 

5 0 CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACGG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

GTGAGACGCT GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG 2 50 

5 5 CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

GAAGCTGGCG GATCAGGC CG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAACGTCGCG AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

6 0 AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 
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GTGATTGACC TGAAGGGCGA GTCGAAGTGY 
GAACGAGCCC CCGGGTGCGC GCGCGCGCGT 
TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC 
GACAACATCT TCCGCTTCAC GCAGGCGAAC 
5 GGGCCGCATT CCGGCCGCCG TGGGCTACCA 
TTGGTATGCT GCAGGAGCGC ATCACGTCGA 
TCCGTGCAGG CCGTGTACGT GCCAGCGGAT 
CGCGACGACG TTCTCGCACC TGGACGCGAC 
TGGCGGAGTC GGGCATCTAC CCTGCCGTGA 

10 CGTATCATGG ACCCTGATGT GATCGATGTG 
GGATATCGTG CAGATGCTGA CCAAGTACAA 
CGGTGCTTGG CATCGACGAG CTGAGCGAGG 
CGCGCGCGCA AGGTGACCCG GTTCCTGTCG 
GGTGTTCACG GGCATGACGG GCCACTACGT 

15 AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT 



GTGCTTGTGT ATGGGCAGAT 600 

TGCGCAGTCT GCGCTGACGA 650 

AGAACGTGCT GCTGTTCATC 700 

TCCGAGGTGT CTGCGCTGCT 7 50 

GCCGACGCTT GCGGAGGATC 8 00 

CAACGAAGGG GTCGATCACG 850 

GATATCACGG ATCCCGCGCC 900 

GACTGTGCTG GACCGCGCGG 950 

ACCCGCTGGA GTGCGCGTCG 1000 

GAC C ACTACA ACGTTGCGCA 1050 

GGAG CTGCAG GATATCATTG 1100 

AAGACAAGGT TGTTGTGGAC 1150 

CAGCCGTTCC AGGTTGCGGA 12 00 

GCAGCTGGTC GACACGGTGG 1250 

ACGACCAGAT CCCGGAGA 12 98 



2) INFORMATION FOR SEQ ID NO: 532 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Lei shmanla aethiopica 

(B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 

3 5 TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GCTGAAGCTG 150 

- AAGT CGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 2 00 

TGAGACGCTG GGC CGCATCT TCAACGTTCT GGGCGACGCG ATCGACCAGC 250 

4 0 GCGGCCCCGT GGGCGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCA 3 00 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 4 00 

TCGGCGGTGC CGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 4 50 

AACGTCGCGA AGGGCCACGG TGGTTTCTCC GTGTTTGCCG GCGTTGGCGA 500 

45 GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGG CGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGCGACG TGGAGGGC CA GAACGTGCTG CTGTTCATCG 7 00 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

50 GGCCGCATTC CAGCCGCCGT GGG CTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC AACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCAGCGGATG ATATCACGGA TCCCGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGCATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

55 GTATCATGGA CCCCGACGTG ATCGATGTGG AC C ACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGC ATCGACGAGC TGAGCGAGGA AGACAAGGTT GTTGTGGACC 1150 

GCGCGCGC7KA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 12 00 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGTCG ACACGGTGGA 12 50 

60 GTCGTTCTCT GGCCTGCTGA TGGGGTCGTA CGACCAGATC CCGGAGA 12 97 



460 



CA 02283458 1999-09-28 



2) INFORMATION FOR SEQ ID NO: 533 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: _ 
15 ( A ) ORGANISM: Leishmania donovani subsp. donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533 

90 CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

G^GTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

2 5 CGCGGCCCCG TGGGCGAGAA GATGCG CATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

C AACGT CGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

3 0 AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 60 0 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

35 GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATAT CACGG ATCCCGCGCC 90 0 

CGCGACGACG TTCTCGCACC TGGACGC GAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA AC CCGCTGGA GTGCGCGTCG 10 00 

4 0 CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATAT CATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 12 0 0 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

4 5 AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



2) INFORMATION FOR SEQ ID NO: 534 

50 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Leishmania donovani subsp. infantum 
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(B) 



STRAIN: MOU 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534 



5 CTTCTCGGAG 
ACCTTGGCCG 
GCGAACACCG 
GAAGTCGAAG 
GTGAGACGCT 

10 CGCGGCCCCG 
AAAGCTGGCG 
AGGTGAT CGA 
TTCGGCGGTG 
CAACGTCGCG 

15 AGC GCACGCG 
GTGATTGACC 
GAACGAGCCC 
TGGC GGAGTA 
GACAACATCT 

2 0 GGGCCGCATT 

TTGGTATGCT 
TCCGTGCAGG 
CGCGACGACG 
TGGCGGAGTC 
25 CGTATCATGG 
GGATATCGTG 
CGGTGCTTGG 
CGCGCGCGCA 
GGTGTTCACG 

3 0 AGTCGTTCTC 



GGCGTGCCGC 

CGATGAGCCG 

GCCGCTGCAT 

GTTGTGTCGA 

GGGCCGCATC 

TGGG CGAGAA 

GACCAGGCCG 

CCTCATTCTG 

CCGGTGTGGG 

AAGGGCCACG 

CGAGGGCACG 

TGAAGGGCGA 

CCGGGTGCGC 

CTTCCGCGAC 

TCCGCTTCAC 

CCGGCCGCCG 

GCAGGAGCGC 

CCGTGTACGT 

TTCTCGCACC 

GGGCAT CTAC 

ACCCCGATGT 

CAGATGCTGA 

CATCGACGAG 

AGGTGACCCG 

GGCATGACGG 

TGGCCTGCTG 



CCGTACTGAC 

CTGACGCTGG 

TGCGATGCAG 

CCGGCGGCAA 

TTCAACGTTC 

GATGCGCATG 

CGGAGGACAC 

CCCTACTGCA 

CAAGACTGTG 

GTGGCTTCTC 

GACCTATACC 

GTCGAAGTGC 

GCGCGCGCGT 

GTGGAGGGCC 

G CAGGCGAAC 

TGGGCTACCA 

ATCACATCGA 

GCCGGCGGAT 

TGGACGCGAC 

CCTGCCGTGA 

GATCGATGTG 

CCAAGTACAA 

CTGAGCGAGG 

GTTCCTGTCG 

GCCACTACGT 

ATGGGGT CGT 



GGCGCTGGAT 

AGATCGTGCA 

ACGACGGACC 

CATCTCTGTG 

TGGGCGACGC 

GCGATCCACG 

GATCCTGACG 

AGGG CGGCAA 

ATCATCATGG 

CGTGTTTGCC 

TGGAGATGAT 

GTGCTTGTGT 

TGCGCAGTCT 

AGAACGTGCT 

TCCGAGGTGT 

GCCGACGCTT 

CGACGAAGGG 

GATATCACTG 

GACTGTGCTG 

ACCCGCTGGA 

GACCACTACA 

GGAGCTGCAG 

AGGACAAGGT 

CAGCCGTTCC 

GCAGCTGGCC 

ACGAC CAGAT 



GTGACGGAGG 5 0 

GCACTTGGAT 100 

TGCTGAAGCT 150 

CCGGTGGGCC 2 00 

GATCGACCAG 2 50 

CCGAGGCCCC 3 00 

ACCGGCATCA 3 50 

GATCGGCCTG 400 

AGC TGATCAA 450 

GGCGTTGGCG 500 

GCAGTCGAAG 550 

ACGGGCAGAT 600 

GCGCTGACGA 650 

GCTGTTCATC 70 0 

CTGCGCTGCT 750 

GCCGAGGATC 800 

GTCGATCACG 850 

ATCCCGCGCC 90 0 

GACCGCGCGG 950 

GTGCGCGTCG 1000 

ACGTTGCGCA 105 0 

GATATCATTG 110 0 

TGTGGTGGAC 1150 

AGGTTGCGGA 12 0 0 

GACACGGTGG 12 50 

CCCGGAGA 12 98 



2) INFORMATION FOR SEQ ID NO: 535 



35 



( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 13 01 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 

50 GCACTTCTCG GAGGGCGTGC CGCCCGTGCT GACGGCGCTG GATGTGACGG 50 

AGGACCTTGG CCGCGATGAG CCGCTGACGC TGGAGATCGT GCAGCACTTG 10 0 

GAC GCGAACA CCGGCCGCTG CATTGCGATG CAGACGACGG ACCTGCTGAA 150 

GCTGAAGTCG AAGGTTGTGT CGACCGGTGG CAACATCTCT GTGCCGGTGG 200 

GCCGTGAGAC GCTGGGCCGC ATCTTCAACG TTCTGGGCGA TGCGAT CGAC 250 

55 CAGCGCGGCC CCGTGGGCGA GAAGATGCGC ATGGCGATCC ACGCCGAGGC 3 00 

CCCGAAGCTG GCGGATCAGG CCGCGGAGGA CACGATCCTG ACGACCGGCA 3 50 

TCAAGGTGAT CGACCTGATT CTGCCCTACT GCAAGGGTGG CAAGATCGGY 4 00 

CTGTTCGGCG GTGCCGGTGT GGGCAAGACT GTGATCATCA TGGAGCTGAT 450 

CAACAACGTC GCGAAGGGCC ACGGTGGTTT CTCCGTGTTT GCCGGCGTTG 500 

6 0 GCGAGCGCAC GCGCGAGGGC ACGGAC CTGT ACCTGGAGAT GATG CAGTCG 550 
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AAGGTGATTG ACCTGAAGGG CGAGTCGAAG TGCGTGCTTG TGTACGGGCA 600 

GATGAACGAG CCCCCGGGTG CGCGCGCGCG CGTTGCGCAG TCTGCGCTGA 650 

CGATGGCGGA GTACTTCCGC GACGTGGAGG GCCAGAACGT GCTGCTGTTC 700 

ATCGACAACA TCTTCCGCTT CACGCAGGCG AACTCCGAGG TGTCCGCGCT 75 0 

5 GCTGGGCCGC ATTCCGGCCG CCGTGGGCTA CCAGCCGACG CTTGCGGAGG 800 

ATCTTGGTAT GCTGCAGGAG CGCATCACGT CGACAACGAA GGGGTCGATC 850 

ACGTCCGTGC AGGCCGTGTA CGTGCCAGCG GATGATAT CA CGGATCCCGC 900 

GCCCGCGACG ACGTTCTCGC ACCTTGACGC GACGACTGTG CTGGACCGCG 950 

CGGTGGCGGA GTCGGGCATC TACCCTGCCG TGAACCCGCT GGAGTGCGCG 1000 

10 TCGCGTATCA TGGACCCCGA TGTGATCGAT GTGGACCACT ACAACGTTGC 1050 

GCAGGATATC GTGCAGATGC TGACCAAGTA CAAGGAGCTG CAGGACATCA 1100 

TTGCGGTGCT TGGCATCGAC GAGCTGAGCG AGGAAGACAA GGTTGTGGTG 1150 

GACCGCGCGC GCAAGGTGAC CCGGTTCCTG TCGCAGCCGT TCCAGGTTGC 1200 

GGAGGTGTTC ACGGGCATGA CGGGCCACTA CGTGCAGCTG GTCGACACGG 1250 

15 TGGAGTCGTT CTCTGGCTTG CTGATGGGGT CGTACGACCA GATCCCGGAG 13 00 

A 1301 



20 ' 2) INFORMATION FOR SEQ ID NO: 536 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE : Genomi c DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertlgi subsp. hertlgi 

(B) STRAIN: ATCC 50125 



35 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 53 6 



CTTCGCGGAG GGCGTGCCGC CGGTGCTGAC GTCGCTGGAT GTGACGGAGA 5 0 

ACCTCGGCCG CGATGAGCCG CTGACGCTGG AGATTGTGCA GCACTTGGAC 10 0 

GCGAACACCG GTCGCTGGAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGTGGCAA CATCTCTGTG CCTGTTGGCC 2 00 

40 GCGAGACGCT GGGTCGCATC TTCAACGTGC TTGGCGATGC GATTGACCAG 25 0 

CGCGGCCCTG TGGGTGAGAA GATGCGCATG GCGATCCACG CCGAGGCGCC 300 

GAAGCTGGCG GATCAGGCGG CAGAGGACAC GATCCTGACG AC CGGCATCA 3 50 

AGGTGATCGA TCTTATTCTG CCGTACTGCA AGGGTGGTAA GATCGGTCTG 4 00 

TTCGGTGGTG CCGGTGTAGG CAAGACTGTG ATTATTATGG AGCTGATCAA 4 50 

45 TAACGTGGCG AAGGGCCACG GTGGGTTTTC CGTGTTTGCT GGCGTGGGCG 50 0 

AGCGCACGCG CGAGGGCACT GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA ATCAAAGTGC GTGCTTGTGT ACGGACAGAT 60 0 

GAACGAGCCC C CGGGTGC GC GTGCGCGCGT TGCGCAGTCT GCGCTGACGA 65 0 

TGGCCGAGTA CTTCCGCGAT GTGGAGGGCC AGAACGTGCT GCTGTTCATT 700 

50 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 
GGGTCGCATT CCTGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 8 00 

TGGGCATGCT GCAGGAGCGC ATTACGTCGA CGACGAAGGG CTCGATTACG 8 50 

TCTGTGCAGG CCGTGTACGT GC CTGCGGAT GAT AT C AC GG ACCCGGCGCC 90 0 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGTGCGG 950 

55 TGGCAGAGTC GGGCATTTAC CCTGCGGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GAC CACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 110 0 

CCGTGCTTGG TATCGACGAG CTGAG CGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGC GGA 12 0 0 

60 GGTGTTCACT GGCATGACGG GTCACTACGT TCAGCTGGAG GACACGGTGG 125 0 



463 



CA 02283458 1999-09-28 



AGTCGTTCTC TGGCCTACTG ATGGGGTCAT ACGACCAGAT CCCGGAGA 12 98 



5 2) INFORMATION FOR SEQ ID NO: 537 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
10 ( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmemia major 

(B) STRAIN: ATCC 50122 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 



20 



CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG TGATGAGCCG CTGACGCTGG AGAT CGTGCA GCACTTGGAC 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGT CGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

25 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGATGC GAT CGAC GAG 250 

CGCGGCCCCG TGGGC GAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

GAAGCTGGCG GATCAGGCCG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CTTGATCCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 4 50 

3 0 CAATGTCGCG AAGGGCCACG GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 5 50 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT AC GGGC AGAT 6 00 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA €50 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 7 00 

3 5 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CCGCGCTGCT 7 50 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 8 00 

TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 8 50 

TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGATGCGAC GACTGTGCTG GACCGCGCGG 950 

4 0 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GACAT CATTG 1100 

CGGTGCTTGG CAT CGACGAG CTGAGCGAGG AAGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 12 00 

4 5 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 

AGTCGTTCTC TGGC CTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAG 12 97 



50 2) INFORMATION FOR SEQ ID NO: 538 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538 

5 

TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 
CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACCTGGACG 
CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GTTGAAGCTG 
AAGTCGAAGG TTGTGTCGAC CGG CGGCAAC ATCTCTGTGC CGGTGGGCCG 

10 TGAGACGCTG GGCCGCATCT TCAACGTGCT GGGCGACGCG ATCGACCAGC 
GCGGCCCCGT GGGTGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCG 
AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 
GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 
TTGGTGGCGC CGGTGTGGGC AAGACCGTGA TCATCATGGA GTTGATTAAC 

15 AACGTCGCGA AGGGCCACGG TGGTTTCTCG GTGTTTGCCG GCGTTGGCGA 
GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 
TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 
AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 
GGCGGAGTAC TTCCGAGACG TGGAGGGCCA GAATGTGCTG CTGTTCATCG 

20 ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 
GGCCGCATTC CGGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 
TGGTATGCTG CAGGAGCGCA TCACGTCGAC GACGAAGGGG TCGATCACGT 
CCGTGCAGGC CGTGTACGTG CCTGCGGATG ATATCACGGA TCCGGCGCCC 
GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 

25 GGCGGAGTCG GGGATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 
GTATCATGGA CCCCGATGTG ATCGACGTGG AC C ACTACAA CGTTGCGCAG 
GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATAT CATTGC 
GGTGCTTGGT ATCGACGAGC TGAGCGAGGA GGACAAGGTC GTGGTGGACC 
GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 

3 0 GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGCCG ACACGGTGGA 
GTCGTTCTCT GGGCTGCTGA TGGGGTCGTA CGAC CAGATC CCGGAGA 



35 2) INFORMATION FOR SEQ ID NO: 539 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1297 



40 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) . TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: DNA 



45 



(xi) 



SEQUENCE DESCRIPTION: SEQ 



ID NO: 539 



CCITACATCC TBGTYGCICT IAACAAG 



27 



50 



2) INFORMATION FOR SEQ ID NO: 54 0 



55 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: DNA 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540 
GGDGCITCYT CRTCGWAITC CTG 23 

5 

2) INFORMATION FOR SEQ ID NO: 541 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541 
GTKGAAATGT TCCGCAAGCT GCT 23 

20 

2) INFORMATION FOR SEQ ID NO: 542 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542 

3 5 CGGAARTAGA ACTGSGGACG GTAG 24 



40 



2) INFORMATION FOR SEQ ID NO: 543 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543 

50 

ATCTTAGTAG TTTCTGCTGC TGA 2 3 



55 2) INFORMATION FOR SEQ ID NO: 544 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AYGTTGTCGC CMGGCATTMC CAT 



2) INFORMATION FOR SEQ ID NO: 545 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TACATCCTBG TYGCICTIAA CAAGTG 



2) INFORMATION FOR SEQ ID NO: 546 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CCRCGI CCGG TRATGGTGAA GAT 



2) INFORMATION FOR SEQ ID NO: 54 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
. GTAC AGTTG C TTCAGGACGT ATC 



2) INFORMATION FOR SEQ ID NO: 548 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 8 
ACGTTCGATT TCATCACGTT G 



2) INFORMATION FOR SEQ ID NO: 549 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactlae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 9 
GAACGTGATA CTGACAAACC TTTA 



2 ) INFORMATION FOR SEQ ID NO: 550 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 0 
GAAGAAGAAC ACCAACGTTG 



2) INFORMATION FOR SEQ ID NO: 551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551 
GAAGAAAAAA TCTTCGAACT GGCTA 



2) INFORMATION FOR SEQ ID NO: 5 52 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552 
TACACGGCCG GTGACTACG 



-2) INFORMATION FOR SEQ ID NO: 553 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553 
GGCCGTGTTG AACGTGGTCA AATCA 



2) INFORMATION FOR SEQ ID NO: 554 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554 
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GTTCCTTACA TCGTTGTTTT TCTC 



2) INFORMATION FOR SEQ ID NO: 555 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 
TCTCGAACTT TCTCTATGTA TGCA 



2) INFORMATION FOR SEQ ID NO: 556 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 556 
CGGCGCNATC YTSGTTGTTG C 



2) INFORMATION FOR SEQ ID NO: 557 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557 
CCMAGGCATR ACCATCTCGG TG 



2) INFORMATION FOR SEQ ID NO: 5 58 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558 
10 TCITTYAART AYGCITGGGT 



2 ) INFORMATION FOR SEQ ID NO: 559 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 

25 

CCGACRGCRA YIGTYTGICK CAT 2 3 



30 2) INFORMATION FOR SEQ ID NO: 560 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560 

GAYTT CAT YA ARAAYATGAT YAC 23 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 561 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561 
ACIGTICGGC CRCCCTCACG GAT 23 
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2) INFORMATION FOR SEQ ID NO: 562 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562 
CARATGRAYG ARCCICCIGG IGYIMGIATG 3 0 

15 



2) INFORMATION FOR SEQ ID NO: 563 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563 

3 0 GGYTGRTAIC CIACIGCIGA IGGCAT 26 



35 



2) INFORMATION FOR SEQ ID NO: 564 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564 

45 

TAYGGI CARA TGAAYGARCC ICCIGGIAA 2 9 



50 2) INFORMATION FOR SEQ ID NO: 565 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565 
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15 



20 



55 



GGYTGRTAIC CIACIGCIGA IGGDAT 2 6 



2) INFORMATION FOR SEQ ID NO: 566 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566 
TTYGGIGGIG CIGGIGTIGG IAARAC 2 6 

2) INFORMATION FOR SEQ ID NO: 567 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567 
TCRTCIGCIG G I ACRTAI AY IGCYTG 2 6 

35 

2) INFORMATION FOR SEQ ID NO: 568 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568 
50 RTIATIGGIG CIGTIRTIGA YGT 23 



2) INFORMATION FOR SEQ ID NO: 569 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ 

5 

RTIRTIGGIS CIGTIRTIGA TAT 



10 2) INFORMATION FOR SEQ ID NO: 570 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
2 0 (xi) SEQUENCE DESCRIPTION: SEQ 

RTIRYIGGIC CIGTIRTIGA YGT 



25 

2 ) INFORMATION FOR SEQ ID NO: 571 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ 
RTIRTIGGIC CIGTIRTIGA TGT 

40 

2) INFORMATION FOR SEQ ID NO: 572 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ 
RTIRTIGGIS CIGTIRTIGA 

55 



ID NO: 569 

23 



ID NO: 570 

23 



ID NO: 571 

23 



ID NO: 572 

20 



60 



2) INFORMATION FOR SEQ ID NO: 573 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CCICCIACCA TRTARAAIGC 



2) INFORMATION FOR SEQ ID NO: 574 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
ATIGCIATGG AYGGIACIGA RGG 



2) INFORMATION FOR SEQ ID NO: 575 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TIACCATTTC AGTACCTTCT GGTAA 



2) INFORMATION FOR SEQ ID NO: 576 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



AACTTCRTCA AGAAGGTYGG TTACAA 
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2) INFORMATION FOR SEQ ID NO: 577 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 577 

CATGATTGAA CCATCCACCA 2 0 



20 



30 



35 



60 



2) INFORMATION FOR SEQ ID NO: 578 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578 
CATGATTGAA GCTTCCACCA 2 0 



40 2) INFORMATION FOR SEQ ID NO: 579 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear" 

(ii) MOLECULE TYPE: DNA 

5 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 79 
55 GAAGGC CGTG CTGGTGAGAA 2 0 



2) INFORMATION FOR SEQ ID NO: 58 0 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Enterococcus faecalis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580 
GCTAAAC CAG CTACAATCAC TCCAC 



2) INFORMATION FOR SEQ ID NO: 581 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear . 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581 
ACATCGGTGC ATTATTACGT GG 



2) INFORMATION FOR SEQ ID NO: 582 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582 
TTTCAACTTC GTCGTTGACA CGAACAGT 



2) INFORMATION FOR SEQ ID NO: 583 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 83 
CAACTGCTTT TTGGATATCT TCTTTAATAC CAACG 



2) INFORMATION FOR SEQ ID NO: 584 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584 
ACATGACACA TCTAAAACAA 



2) INFORMATION FOR SEQ ID NO: 585 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585 
ACCACATACT GAATTCAAAG 



2) INFORMATION FOR SEQ ID NO: 586 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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10 



30 



40 



45 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586 
CAGAAGTATA CGTATTATCA 2 0 

2) INFORMATION FOR SEQ ID NO: 587 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

2 0 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587 

CGTATTATCA AAAGACGAAG 20 



2) INFORMATION FOR SEQ ID NO: 588 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 58 8 
TCTTCTCAAA CTATCGTCCA 2 0 



50 2 ) INFORMATION FOR SEQ ID NO : 589 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus epidermidi 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589 
GCACGAAACT TCTAAAACAA 



2) INFORMATION FOR SEQ ID NO: 590 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590 
TATACGTATT ATCTAAAGAT 



2) INFORMATION FOR SEQ ID NO: 591 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591 
TCCTGGTTCT ATTACACCAC 



2 ) INFORMATION FOR SEQ ID NO: 592 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592 
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CAAAGCTGAA GTATACGTAT 



2) INFORMATION FOR SEQ ID NO: 593 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epldermidii 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593 
TTCACTAACT ATCGCCCACA 



2) INFORMATION FOR SEQ ID NO: 594 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594 
ATTGGTATCC ATGACACTTC 



2 ) INFORMATION FOR SEQ ID NO: 595 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 95 
TTAAAGCAGA CGTATACGTT 
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2) INFORMATION FOR SEQ ID NO: 596 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596 
GAAATTATTG GTATCAAAGA 



2) INFORMATION FOR SEQ ID NO: 597 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597 
ATTGGTATCA AAGAAACTTC 



2) INFORMATION FOR SEQ ID NO: 5 98 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598 
AATTACACCT CACACAAAAT 



2) INFORMATION FOR SEQ ID NO: 599 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 599 
CGGTGAAGAA ATCGAAATCA 



2) INFORMATION FOR SEQ ID NO: 600 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 0 
ATGCAAGAAG AATCAAGCAA 



2) INFORMATION FOR SEQ ID NO: 601 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601 
GTTTCACGTG ATGATGTACA 



2) INFORMATION FOR SEQ ID NO: 602 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602 



AAGTTGAAGT TGTTGGTATT 



2) INFORMATION FOR SEQ ID NO: 603 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 603 
GGTATTAAAG ACGAAACATC 



2) INFORMATION FOR SEQ ID NO: 6 04 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604 
GGTGATGAAG TAGAAATCGT 



2) INFORMATION FOR SEQ ID NO: 605 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605 
GAAATGTTCC GTAAATTATT 2 0 

10 2) INFORMATION FOR SEQ ID NO:606 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606 

ATTAGACTAC GCTGAAGCTG 20 



25 



35 



2) INFORMATION FOR SEQ ID NO: 607 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 821 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: ATCC 2 9212 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 50 

GTGAACATAT CTTATTATCA CGTAACGTTG GTGTACCATA CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

45 AGAAATGGAA GTTCGTGACT TATTATCAGA ATACGATTTC CCAGGCGATG 2 00 

ATGTTCCAGT TATCGCAGGT TCTGCTTTGA AAGCTTTAGA AGGCGACGAG 250 

TCTTATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 3 50 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG ACGTGTTGAA 4 00 

50 CGTGGTGAAG TTCGCGTTGG TGACGAAGTT GAAATCGTTG GTATTAAAGA 450 

CGAAACATCT AAAACAACTG TTACAGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCAGGC GACAACTTCG GTGCTTTATT ACGTGGTGTA 550 

GCACGTGAAG ATATCGAACG TGGACAAGTA TTAGCTAAAC CAGCTACAAT 60 0 

CACTCCACAC ACAAAATTCA AAGCTGAAGT ATACGTATTA TCAAAAGAAG 650 

55 AAGGCGGACG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAATTCTAC 7 00 

TTCCGTACAA CAGACGTTAC TGGTGTTGTA GAATTGC CAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTTGCTAT GGACGTTGAA TTAATTCACC 800 

CAATCGCTAT CGAAGACGGA A 821 

60 
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2) INFORMATION FOR SEQ ID NO: 608 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 



15 



35 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608 



CGGAGCTATC TTGGTAGTTT CTGCTGCTGA CGGCCCAATG CCTCAAACTC 50 

GTGAACACAT CCTATTGTCT CGTCAAGTTG GTGTT C CTTA CATCGTTGTA 10 0 

2 0 TTCTTGAACA AAGTAGACAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACC TATTAACAGA ATACRAATTC CCTGGTGRCG 20 0 

ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA AAGCTCTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATTCT TGAATTAATG GCTGCAGTTG ACGAATACAT 300 

CCCAACTCCA GAACGTGACA ACGACAAACC ATT C ATGATG CCAGTTGAAG 35 0 

2 5 ACGTGTTCTC AATTACTGGA CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAGTTGTTG GTATTG CTGA 45 0 

AGAAACTTCA AAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACYACGC TGAAGCTGGA GACRACATTG GTGCTTTACT ACGTGGTGTT 55 0 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGC TAAAC CAGGTACAAT 60 0 

3 0 • CACACCTCRT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 65 0 

AAGGTGGACG TCATACTC CA TTCTTCACTA ACTACCGTCC ACAATTCTAC 70 0 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

A 751 



2) INFORMATION FOR SEQ ID NO: 609 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 751 bases 

( B ) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609 



CGGTGCGATC TTAGTAGTAT CTGCTGCTGA CGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GCGWAC CATA CATCGTTGTT 100 

55 TTCTTGAACA AAATGGATAT GGTTGAYGAC GAAGAATTGC TAGAATTAGT 15 0 

TGAAATGGAA GTTCGTGACC TATTGTCTGA ATATGACTTC CCAGGCGACG 2 00 

ATGTTCCTGT AATCGCCGGT TCTGCTTTGA AAGCTCTTGA AGGAGATCCT 25 0 

TCATACGAAG AAAAAATCAT GGAATTGATG GCTGCAGTTG ACGAATACGT 3 00 

TCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 3 50 

60 ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 
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10 



CGTGGACAAG TTCGCGTTGG TGATGAAGTA GAAATCGTTG GTATTGCTGA 4 50 

CGAAACTGCT AAAACAACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 50 0 

TAGAC TATGC TGAAGCAGGG GATAACATTG GTGCATTGCT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACRAT 600 

CACACCTCAT ACAAAATT C A AAGC TGAAGT TTATGTTTTG ACAAAAGAAG 650 

AAGGTGGRCG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAGTTCTAC 7 00 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATTACCAG AAGGAACTGA 750 



A 



2) INFORMATION FOR SEQ ID NO: 610 



751 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus Influenzae 

(B) STRAIN: Rd 

25 (C) ACCESSION NUMBER: extracted from U32648 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610 

AATATGATTA CTGGTGCGGC ACAAATGGAT GGTGCTATTT TAGTAGTAGC 50 

30 AGCAACAGAT GGTCCTATGC CACAAACTCG TGAACACATC TTATTAGGT C 10 0 

GCCAAGTAGG TGTTCCATAC ATCATCGTAT TCTTAAACAA ATGCGACATG 15 0 

GTAGATGACG AAGAGTTATT AGAATTAGTC GAAATGGAAG TTCGTGAACT 2 00 

TCTATCTCAA TATGACTTCC CAGGTGACGA TACAC CAATC GTACGTGGTT 250 

CAGCATTACA AGCGTTAAAC GGCGTAGCAG AATGGGAAGA AAAAATCCTT 3 00 

3 5 GAGTTAGCAA ACCACTTAGA TACTTACAT C CCAGAACCAG AACGTGCGAT 3 50 

TGACCAACCG TTCCTTCTTC CAATCGAAGA TGTGTTCTCA ATCTCAGGTC 4 00 

GTGGTACTGT AGTAACAGGT CGTGTAGAAC GAGGTATTAT CCGTACAGGT 4 50 

GATGAAGTAG AAATCGTCGG TATCAAAGAT AC AGC GAAAA CTACTGTAAC 50 0 

GGGTGTTGAA ATGTTCCGTA AATTACTTGA CGAAGGTCGT GCAGGTGAAA 550 

40 ACATCGGTGC ATTATTACGT GGTAC CAAAC GTGAAGAAAT CGAACGTGGT 6 00 

CAAGTATTAG CGAAACCAGG TTCAATCACA CCACACACTG ACTTCGAATC 650 

AGAAGTGTAC GTATTATCAA AAGATGAAGG TGGTCGTCAT ACTCCATTCT 70 0 

TCAAAGGTTA CCGTCCACAA TTCTATTTCC GTACAACAGA CGTGAC TGGT 750 

ACAATCGAAT TACCAGAAGG CGTGGAAATG GTAATGCCAG GCGATAACAT 8 00 

4 5 CAAGATGACA GTAAGCTTAA TCCACCCAAT TGCGATGGAT CAAGGTTTAC 8 50 

GTTTCGCAAT CCGTGAAGGT GGC CGTACAG TAGGTGCAGG C 8 91 



50 2) INFORMATION FOR SEQ ID NO: 611 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
6 0 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus epidermidls 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611 

5 

CGGCGGTATC TTAGTTGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 10 0 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 200 

10 ATGTACCTGT AATCGCTGGT TCTGCATTAA AAGCATTAGA AGGCGATGCT 250 

GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCAGTTG ATGATTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATC AT CG GTATGGACGA 450 

15 AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCA 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCTATTAC 60 0 

ACCACACACA AAATTCAAAG CTGAAGTATA CGTATTATCT AAAGATGAAG 650 

GTGGACGTCA CACTCCATTC TTCACTAACT ATCGCCCACA ATTCTATTTC 70 0 

2 0 CGTACTACTG ACGTAACTGG TGTTGTAAAC TTACCAGAAG GTACAGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 800 

TCGCTATCGA AGACGGAA 818 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 612 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 825 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype parathyphi A 

(B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

45 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACGCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

CCGGAACCAG AG CGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

50 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 400 

GCGGTAT CAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AAC TGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CAC CAT C AAG 60 0 

55 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 65 0 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 70 0 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGC CGGAAGG CGTAGAGATG 75 0 

GTAATGC CGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTC 82 5 

60 
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10 



2) INFORMATION FOR SEQ ID NO: 613 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 
15 (B) STRAIN: ATCC 33105 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613 

GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 5 0 

2 0 TGAGCACATC CTGCTGGGTC GYCAGGTTGG CGTTCCTTTC ATCATCGTRT 10 0 

TCATGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAACT GCTGTCCGCT TACGACTTCC CTGGCGATGA 200 

CCTGCCGGTG ATTCGCGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG AMCMS CTGGA TACTTACATC 3 00 

25 CCAGAACCAG AGCGCGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT GGTTACCGGT CGTGTTGAGC 4 00 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGTCAAGT CTACCTGTAC CGG CGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

3 0 GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCCATCAAG 600 

CCGCACACCC AGTTCGATTC AGAAGTGTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACKCCATTCT TC AAAGGC TA CCGTCCACAG TTCTACTTCC 700 

GTACTACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCTG GCGACAACGT GAACATGA 77 8 

35 



2) INFORMATION FOR SEQ ID NO: 614 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 
50 (B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614 

GTGCGATCTT AGTAGTATCA GCTACTGATG GTCCAATGCC TCAAACTCGT 50 

55 GAACACATTT TGTTATCACG TCAAGTTGGT GTTAAGCACT TGATCGTTTT 100 

CTTGAACAAA GTAGATTTAG TTGATGACGA AGAATTGATC GACTTAGTTG 150 

AAATGGAAGT ACGTGAATTA CTTTCTGAAT ATGGTTTCCC AGGTGATGAT 200 

- ATTCCAGTGC TTAAAGGTTC TGCTTTGAAA GCATTAGAAG GCGATCCAGA 250 

ACAAGAACAA GTTATTCTTG ATTTGATGGA TAC CGTTGAT GAATATATCC 3 00 

60 CAACACCTGA ACGTGACAAT GACAAACCGT TCTTGTTACC AGTTGAGGAT 3 50 
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GTTTTCTCGA TCACAGGACG 
TGGCGAAGTT AAAGTCGGCG 
AAGTTCAAAA AGCAATCGTT 
GATTATGGTG AAGCTGGCGA 
5 ACGTGATGAA ATCGAACGTG 
CACCACATAC TAAGTTCAAA 
GGT 



TGGTACTGTA GCTTCTGGTC 
ATGAAATTGA AATCATCGGG 
ACTGGACTTG AAATGTTCCG 
TAACGTTGGG GTTCTATTAC 
GCCAAGTATT AGCTAAACCA 
GCCGAAGTAT ATGTGTTGAC 



GTATCGACCG 4 00 

ATCAAACCTG 450 

TAAAACATTG 500 

GTGGGATTAC 550 

GGTTCAATCA 600 

GAAAGAAGAA 65 0 
653 



10 



20 



45 



2) INFORMATION FOR SEQ ID NO: 615 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 669 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terococcus durans 

(B) STRAIN: ATCC 19432 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615 

GGAGCCATTC TAGTTGTATC TGCAACAGAT GGACCAATGC CACAAACACG 5 0 

TGAACATATT TTATTGTCAC GTCAAGTAGG TGTTAAATAT TTGATCGTCT 100 

TCTTGAACAA AATCGACTTA GTAGATGATG AAGAATTGAT TGATCTTGTC 150 

3 0 GAAATGGAAG TTCGTGAATT ATTAAGCGAA TATGGTTTCC CAGGTGACGA 2 00 

TACACCAGTC ATCAAAGGTT CAGCATTAAA AGCTTTACAA GGAGATCCTG 250 

ATGCAGAAGC AGCTATCATG GAATTGATGG ATACTGTTGA TGAATATATC 3 00 

CCAACACCAG AACGTGATAC AGACAAACCA TTATTGTTAC CAGTGGAAGA 3 50 

TGTCTTCTCA ATCACAGGTC GTGGGACTGT TGCTTCAGGT CGTATCGATC 4 00 

3 5 GTGGTGCAGT TCGTGTAGGT GATGAAATCG AAATCGTCGG TATCAAACCT 450 

GAAACACAAA AAGCTGTTGT AACTGGGGTC GAAATGTTCC GCAAGACATT 50 0 

AGACTATGGT GAAGCAGGAG ATAACGTTGG GGTATTGTTA CGTGGTATCC 550 

AACGTGAAGA TATCGAACGT GGACAAGTAA TCGCAAAACC AGGTTCAATC 60 0 

ACACCACATA CAAAATTCAA AGCAGAAGTG TACGTATTGA CAAAAGAAGA 650 

40 AGGTGGACGT CATACACCA 66 9 



2) INFORMATION FOR SEQ ID NO: 616 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 616 

60 GGTGCAATTT TAGTAGTATC TGCTACTGAT GGCCCAATGC CACAAACACG . 50 
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TGAACATATC TTGTTATCAC GTCAAGTAGG GGTTAAACAC TTAATCGTCT 100 

TCTTGAACAA AGTTGATTTA GTTGATGATG AAGAATTGAT CGATTTAGTT 150 

GAAATGGAAG TTCGGGAATT GCTTTCTGAA TATGGTTTCC CAGGCGATGA 2 00 

TATTCCAGTA CTTAAAGGTT CTGCTTTGAA AGCTTTAGAA GGCGATCCTG 250 

5 AACAAGAACA AGTAATC CTT GACTTGATGG ATACGGTTGA TGAATACATC 3 00 

CCAACGCCTG AAC GTGATAC TGACAAAC CA TTCTTGTTAC CAGTCGAAGA 3 50 

TGTCTTCTCA ATCACAGGAC GTGGTACGGT TGCATCTGGT CGTATCGATC 4 00 

GTGGGGAAGT TAAAGTCGGT GATGAAGTTG AAATCATCGG GATCAAACCT 4 50 

GAAGTGCAAA AAGCTGTCGT AACTGGACTA GAAATGTTCC GTAAGACATT 500 

10 GGATTACGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CTCGTGATGA AATCGAACGT GGACAAGTAT TAGCTAAACC AGGTTCAATC 60 0 

ACTCCACATA CGAAATTCAG TGCAGAAGTT TATGTATTGA CGAAAGAAGA 650 

AGGTGGCCGT CATACGCCA 669 



15 



30 



2) INFORMATION FOR SEQ ID NO: 617 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617 



GGGGCAATTT TAGTTGTATC TGCAACTGAT GGCCCAATGC CACAAACACG 50 

TGAACACATT TTGTTAGCTC GTCAAGTAGG GGTTAAATAT TTAATCGTCT 10 0 

3 5 TCTTGAACAA AACAGATTTA GTTGATGATG AAGAATTATT GGAACTAGTT 150 

GAAATGGAAG TTCGTGAATT ATTAAATGAA TACAATTTCC CTGGCGATGA 2 00 

TATTCCTGTT ATTCGCGGAT CTGCTTTAAA AGCATTAGAA GGCGATCCAG 2 50 

AACAAGAAGA AGTAATTATG AACTTGATGG ATACTGTGGA TGAATATATC 300 

CCAACTCCAG AACGTGACAA TGATAAAC C A TTCTTGTTAC CAGTGGAAGA 350 

4 0 TGTCTTCACA ATTACTGGTC GTGGTACTGT TGCTTCAGGT CGTATCGACC 400 

GTGGTAAAGT CAACGTTGGT GATGAAATTG AAATTATCGG AATTAAAC CA 45 0 

GAAACACAAA AAGCTGTTGT AACCGGTTTG GAAATGTTCC GTAAAACTTT 500 

GGATTATGGT GAAGCTGGTG ATAACGTTGG GGTCTTATTA CGTGGGATTA 55 0 

CTCGTGATGA AGTAGAACGT GGTCAAGTAT TAGCAAAACC AGGTTCCATT 60 0 

45 ACAC CGCATA CCAAATTTAA AGGTGAAGTT TATATCTTAA CAAAAGAAGA 650 

AGGTGGACGT CATACTCC 668 



50 2) INFORMATION FOR SEQ ID NO: 618 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM : Enterococcus avium 

(B) STRAIN: ATCC 14025 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618 

5 

GGTGCAATCC TAGTAGTATC AGCTACTGAT GGTCCAATGC CGCAAACACG 5 0 

TGAACATATT TTGCTATCAC GGCAAGTGGG TGTTAAACAC TTAATCGTAT 100 

TTTTAAACAA AGTTGATTTA GTCGATGATG AAGAATTGAT CGATCTAGTT 150 

GAAATGGAAG TCCGTGAATT ACTTTCTGAA TATGGTTTCC CAGGTGACGA 2 00 

10 TATTC CAGTT CTCAAAGGTT CAGCTTTGAA AGCATTAGAA GGCGATCCTG 2 50 

AACAAGAACA AGTAATCCTT GATTTAATGG ATACAGTTGA CGAATATATC 3 00 

CCAACTCCAG AACGTGACAC TGACAAGCCA TTCTTGTTAC CAGTCGAAGA 3 50 

TGTATTTTCT ATCACTGGTC GTGGGACTGT AGCGTCTGGA CGGATTGATC 4 00 

GTGGTGAAGT TAAAGTCGGC GATGAAGTTG AAATCATCGG GATCAAACCT 450 

15 GAAATTCAAA AAGCAGTCGT AACTGGACTT GAAATGTTCC GTAAAACTTT 50 0 

AGATTATGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CACGTGATGA AATCGAACGT GGTCAAGTCT TAGCTAAACC AGGTTCAATC 600 

ACACCACATA CAAAATTCAG TGCAGAAGTT TACGTATTGA CGAAAGAAGA 650 

AGGTGGACGT CATACACCAT CTT 6 73 

20 

2) INFORMATION FOR SEQ ID NO: 619 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE -TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
35 (C) ACCESSION NUMBER: K00428 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619 

TTT CATATTT TTAAGGATTT TGTTTTAGCA CCCATCCGAC CTCAGTCAAT 50 

4 0 ATATCCTTTC GCGAC CAGGC TTTCCTCCCT TTTGCTGCTA ACTGGTTACA 100 

GATTTTCCTA TTTTTGGTCA TTTTTATCTT TGAAACTGAT TAAGCTGAAA 15 0 

AAATTTGAGC TTCTTTGTTG TAAACTATTT TGTGCTTTCA GTTTTATTCT 2 00 

AGCTCGACAA AGGTAACAGA CAAAAATGTC AGC TTTATTA CCAAGATTAC 250 

TCACAAGAAC AGCTTTTAAA GCTTCTGGGA AACTT CTGAG GCTCTCTTCA 3 00 

45 GTAATTTCTA GGAC CTTTTC TCAAACTACT ACTTCCTATG CAGCTGCTTT 350 

TGATCGTTCC AAACCGCATG TAAATATAGG TACGAT CGGC CATGTTGATC 4 00 

ATGGGAAGAC AACTTTAACC GCAGCCATTA CGAAAACGTT AGCCGCAAAA 45 0 

GGTGGTGCCA ACTTCTTGGA CTATGCTGCC ATCGATAAGG CTC CGGAAGA 500 

AAGAGCTCGT GGTATTACAA TTTCTACTGC ACACGTGGAA TACGAAACGG 550 

50 CCAAGAGACA TTATTCTCAC GTCGACTGTC CAGGCCACGC TGATTACATC 600 

AAGAATATGA TTACCGGTGC TGCTCAAATG GATGGTGCTA TCATTGTTGT 650 

AGCTGCTACC GATGGACAAA TGCCCCAAAC TAGAGAACAT TTACTTTTGG 7 00 

CCAGACAAGT TGGTGTCCAA CATATTGTCG TTTTTGTTAA CAAGGTTGAT 7 50 

ACCATTGATG ATCCAGAAAT GTTAGAGTTA GTCGAAATGG AAATGAGAGA 800 

55 ACTTTTAAAC GAATATGGGT TTGACGGTGA TAATGCTC CA ATTATCATGG 8 50 

GTTCTGCCCT TTGCGCTTTG GAAGGTCGCC AACCTGAAAT TGGGGAGCAG 900 

GCCATCATGA AACTTTTGGA TGCAGTGGAT GAGTATATTC CTACAC CTGA 950 

AAGAGATTTG AACAAG C CTT TCTTGATGCC CGTTGAAGAT ATCTTCTCTA 1000 

TCTCCGGTAG AGGTACTGTG GTCACTGGTC GTGTGGAAAG GGGTAATTTA 1050 

60 AAGAAAGGTG AGGAATTGGA AATTGTTGGT CACAACTCCA CCCCATTGAA 1100 
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AACAACAGTT ACTGGTATTG AAATGTTTAG AAAGGAATTG GACTCTGCTA 115 0 

TGGCAGGTGA CAATGCCGGT GTTTTACTTA GAGGT AT GAG GAGAGATCAA 12 0 0 

TTGAAGAGAG GTATGGTCTT AGCTAAGCCA GGTACCGTTA AAGCCCATAC 1250 

AAAGATTCTA GCCTCTTTGT ACATTTTATC CAAAGAGGAA GGTGGTAGAC 13 00 

5 ATTCTGGGTT TGGTGAAAAC TACAGACCAC AAATGTTTAT AAGAACAGCT 1350 

GATGT TAC AG TTGTGATGAG ATTTCCTAAG GAGGTTGAAG ATCATTCTAT 140 0 

GCAAGTTATG CCAGGTGACA ATGTTGAAAT GGAATGTGAT TTGATCCATC 1450 

CTACCCCATT AGAAGTTGGT CAACGTTTCA ATATCAGAGA GGGTGGAAGA 1500 

ACTGTTGGTA CCGGTCTAAT CACACGTATT ATTGAATAGA CTTATTGATG 1550 

10 CAACTGGAGT ATATTTCTAT ATATTCTGTT CATTTCCCCT CTCATAATAT 1600 

ATACTTGTTT CGTTAAAATT TTATACGTGT AAATAAAGTG CCATAAATTT 165 0 

TTCAGCTTTA CTTTTGGTAG AGTCCTGCTA GCACTAGATT TTACAATTTC 1700 

ATGTGCACAC ACC 1713 



15 



40 



2) INFORMATION FOR SEQ ID NO: 620 



( i ) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

2 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620 
ATTGGTGCAT TGCTACGT 1 8 

30 

2) INFORMATION FOR SEQ ID NO: 621 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
45 (B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621 

TGGTGCAATC TTAGTTGTTT CTGCAACTGA CGGTCCGATG CCTCAAACAC 50 

50 GTGAACACAT TTTATTGTCA CGC CAAGTTG GTGTAAAATA CCTGATTGTT 100 

TTCTTGAACA AAGTTGATTT AGTCGATGAT GAAGAATTGA TCGATTTGGT 150 

AGAAATGGAA GTTCGCGAGT TATTGAGCGA ATATGGTTTC CCAGGCGATG 2 00 

ACACTCCTGT GATCAAAGGT TCCGCATTAA AAGCATTGCA AGGCGATCCA 250 

GATGCTGAAG CTGCTATTAT GGAATTGATG GATACAGTAG ATGAATATAT 3 00 

55 CCCAACACCA GAACGTGATA CAGATAAACC ATTACTATTG CCAGTGGAAG 350 

ACGTCTTCTC AATTACAGGT CGAGGAACTG TTGCCTCAGG TCGTATTGAT 400 

CGTGGTGCTG TTCGTGTCGG TGATGAGGTA GAGATCGTAG GGATCAAACC 450 

TGAAACACAA AAAGCAGTTG TAACAGGTGT AGAAATGTTC CGTAAAACGT 500 

TAGATTACGG GGAAGCTGGG GATAACGTAG GCGTGTTGTT ACGGGGGATC 550 

60 CAACGTGACG ATATCGAACG TGGACAAGTA CTTGCTAAAC CAGGTTCCAT 600 
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TACTCCACAT ACAAAATTCA AAGCAGAAGT 
AAGGTGGACG TCATACTCCA TTCTTCAACA 
TTCCGCACAA CTGATGTTAC AGGAACAATC 
A 



GTACGTGTTG ACAAAAGAAG 650 
ACTATCGTCC ACAGTTCTAC 7 00 

ACATTGCCAG AAGATACAGA 750 

751 



2) INFORMATION FOR SEQ ID NO: 622 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sac char owyces cerevlsiae 
20 (B) STRAIN: ATCC 13264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622 



GTCAAATGGG ACGAATC CAG ATTC CAAGAA ATTGTCAAGG AAACCTCCAA 50 

25 CTTTATCAAG AAGGTTGGTT ACAACCCAAA GACTGTTCCA TTCGTCCCAA 100 

TCTCTGGTTG GAAC GGTGAC AACATGATTG AAGCTACCAC CAACGCTCCA 150 

TGGTACAAGG GTTGGGAAAA GGAAACCAAG GCCGGTGTCG TCAAGGGTAA 2 00 

GACTTTGTTG GAAGCCATTG ACGCCATTGA ACAAC CATCT AGACCAACTG 250 

ACAAGCCATT GAGATTGCCA TTGCAAGATG TTTACAAGAT TGGTGGTATT 3 00 

3 0 GGTACTGTGC CAGTCGGTAG AGTTGAAACC GGTGTCATCA AGCCAGGTAT 3 50 

GGTTGTTACT TTCGCCCCAG CTGGTGTTAC CACTGAAGTC AAGTCCGTTG 4 00 

AAATGCATCA CGAACAATTG GAACAAGGTG TTCCAGGTGA CAACGTTGGT 4 50 

TTCAACGTCA AGAACGTTTC CGTTAAGGAA ATCAGAAGAG GTAACGTCTG 500 

TGGTGACGCT AAGAACGATC CACCAAAGGG TTGCGCTTCT TTCAACGCTA 550 

3 5 CCGTCATTGT TTTGAACCAT CCAGGTCAAA TCTCTGCTGG TTACTCTCCA 60 0 

GTTTTGGATT GTCACACTGC TCACATTGCT TGTAGATTCG ACGAATTGTT 650 

GGAAAAGAAC GACAGAAGAT CTGGTAAGAA GTTGGAAGAC CATCCAAAGT 700 

TCTTGAAGTC CGGTGACGCT GCTTTGGTCA AGTTCGTTCC ATCTAAGCCA 750 

40 



2) INFORMATION FOR SEQ ID NO: 623 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 3 



TCTTGAAAGC TTAAGGCCGA GCGAGAGCGA GGTATCACCA TCGACATTGC 50 
TCTTTGGAAG TTCGAGACCC CCAGGTACCA GGTCACCGTC ATTGACGCCC 100 
60 CCGGTCACCG AGACTTCATC AAGAACATGA TCACCGGTAC CTCCCAGGCT 150 
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GACTGTGCCA TCCTCATCAT TGCCACCGGT ATCGGTGAGT TCGAGGCCGG 2 00 

TATCTCCAAG GACGGTCAGA CCCGAGAGCA CGCCCTCCTC GCCTTCACCC 250 
TCGGTGTCAG GCAGCTCATT GTTGCTTGCA ACAAGATGGA CACCTGCAAG 3 00 

TGGTCCGAGG ACCGATTCAA CGAAATCGTC AAGGAGACCA ACGGTTTCAT 3 50 

5 CAAGAAGGTT GGTTACAACC CCAAGGCTGT CCCCTTCGTC CCCATCTCTG 4 00 

GTTGGCACGG TGACAACATG TTGGAGGAGA CCACCAAGTC AGTGAATCCG 45 0 

CTTCTACGTG ATGAGATGTT TTTCTGACTT TCCGTGCAGC ATGCCCTGGT 500 
ACAAGGGATG GAC CAAGGAG ACCAAGTCTG GTGTTTCCAA GGGTAAGACC 550 
CTCCTCGAGG CCATCGACGC CATCGAGCCC CCTACCCGAC CCACCGACAA 600 

10 GCCCCTCCGT CTCCCTCTCC AGGACGT CTA CAAGATCGGT GGTATCGGCA 650 
CAGTCCCTGT CGGCCGAGTC GAGACCGGTG TCATCAAGGC CGGTATGTTG 700 
TCTCATCTCT CTTGTCTCGT AACATGCGTC TCGTAACATG CGCTTACTTC 7 50 

ATTTTCAGGT ATGGTCGTCA AGTTCGCCCC CACCAACGTC ACCACTGAAG 8 00 

TCAAGTCCGT TGAGATGCAC CACGAGCAGA TCCCCGAGGG TCTTCCCGGA 8 50 

15 GACAAC GTTG GTTTCAACGT CAAGAACGTT TCCATCAAGG ACATC CGACG 900 
AGGTAACGTC TGTGGTGACT CCAAGAACGA CCCCCCTATG GAGGCTGCTT 950 

CTTTCAACGC CCAGGTTATC GTCCTTAACC ACCCTGGTCA GATCGGTGCC 1000 

GGTTACACCC CCGTTCTCGA CTGTCACACT GCCCGTAAGC CTGACCCAAT 10 50 

ACCTCCAACA TACCTTTGAA GCTGACCCTT TCTAGACATT GCCTGCAAGT 1100 

2 0 TTGCTGAGTT GATCGAGAAG ATTGACCGAC GAAC CGGTAA GGTCATGGAG 1150 

GCCGCCCCCA AGTTCGTCAA GTCTGGTGAC GCCGCCATTG TCAAG CTTGT 12 00 

TGCCCAGAAG CCCCTCTGTG TTGAGAC CTA CGCCGACTAC CCCCCTCTTG 1250 

GTCGATTCGC CGTCCGAGA 12 69 



25 



2) INFORMATION FOR SEQ ID NO: 624 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 753 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 36801 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624 



TCTGTCAAAT GGGACAAAAA CAGATTTGAA GAAATCATCA AGGAAACCTC 50 

CAACTTCGTC AAGAAGGTTG GTTACAACCC AAAGACTGTT CCATTCGTTC 100 

45 CAATCTCTGG TTGGAATGGT GACAACATGA TTGAACCATC CAC CAACTGT 150 

CCATGGTACA AGGGTTGGGA AAAGGAAACC AAATCCGGTA AAGTTACTGG 2 00 

TAAGAC CTTG TTAGAAGCTA TTGACGCTAT TGAACCACCA ACCAGAC CAA 2 50 

CCGACAAACC ATTGAGATTG CCATTGCAAG ATGTTTACAA GATCGGTGGT 3 00 

ATTGGTACTG TGCCAGTCGG TAGAGTTGAA ACTGGTATCA TCAAAGCCGG 3 50 

50 TATGGTTGTT ACTTTCGCCC CAGCTGGTGT TACCACTGAA GTCAAATCCG 4 00 

TTGAAATGCA TCACGAACAA TTGGCTGAAG GTGTTCCAGG TGACAATGTT 450 

GGTTTCAACG TTAAGAACGT TTCCGTTAAA GAAATTAGAA GAGGTAACGT 500 

TTGTGGTGAC TCCAAGAACG ATCCACCAAA GGGTTGTGAC TCTTTCAATG 550 

CCCAAGTCAT TGTTTTGAAC CATCCAGGTC AAATCTCTGC TGGTTACTCT 600 

55 CCAGTCTTGG ATTGTCACCC TGCCCACATT GCTTGTAAAT TCGACACTTT 650 

GGTTGAAAAG ATTGACAGAA GAACTGGTAA GAAATTGGAA GAAAATCCAA 700 

AATTCGTCAA ATCCGGTGAT GCTGCTATCG TCAAGATGGT CCCAACCAAA 750 

CCA 753 

60 
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2) INFORMATION FOR SEQ ID NO: 625 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625 

CGTTGAAGAC ACGACCCAAA GTATCC 2 6 



20 



30 



35 



60 



2) INFORMATION FOR SEQ ID NO: 626 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 6 
TACCACCTTT TAAGTAAGGT GCTAAT 2 6 



4 0 2) INFORMATION FOR SEQ ID NO: 62 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627 
55 ATTGTCTATA AAAATGGCGA TAAGTC 2 6 



2) INFORMATION FOR SEQ ID NO: 628 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 8 
AAAATGGCGA TAAGTCACAA AAAGTA 2 6 



15 



30 



35 



55 



2) INFORMATION FOR SEQ ID NO: 629 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629 
AAGTTCCATC TCAACAAGGT CAATA 2 5 

2) INFORMATION FOR SEQ ID NO: 63 0 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
(C> STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 0 

CGGAGCTATC CTAGTCGTTT CA 22 



2) INFORMATION FOR SEQ ID NO: 631 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
60 (B) TYPE: Nucleic acid 
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10 



45 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631 
CAGACCAACY GAIAARCCAT TRAGAT 2 6 

2) INFORMATION FOR SEQ ID NO: 632 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 bases 

<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632 
CCCTTTGGTG GRTCS TKCTT GGA 23 

25 

2) INFORMATION FOR SEQ ID NO: 633 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633 
40 CAGACCAACY GAIAARCCIT TRAGAT 2 6 



2) INFORMATION FOR SEQ ID NO: 634 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634 

55 

AACACYGTCA GRRC I ATTGC YATGGA 2 6 



60 2) INFORMATION FOR SEQ ID NO: 63 5 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635 

AAAC CRGT I A RRGCRACTCT IGCTCT 26 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 63 6 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636 

AC TGG YGTTG AIATGTTCCG YAA 23 

2) INFORMATION FOR SEQ ID NO: 63 7 



(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 7 
ACGTCAGT I G TACGGAARTA GAA 2 3 

45 

2) INFORMATION FOR SEQ ID NO: 638 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638 
6 0 CCAATGCCAC AAACI CGTGA RCACAT 26 
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2) INFORMATION FOR SEQ ID NO: 639 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 
TTTACGGAAC ATTTCWACAC CWGTIACA 



2 ) INFORMATION FOR SEQ ID NO: 640 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
TCCATGGT I T WYGG I CARAT GAA 



2) INFORMATION FOR SEQ ID NO: 641 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
TGATAACCWA CIGCIGAIGG CATACG 



2) INFORMATION FOR SEQ ID NO: 642 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : S ingl e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642 
GGCGTIGGIG ARCGIACICG TGA 23 

5 

2) INFORMATION FOR SEQ ID NO: 643 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643 
2 0 ACTGGIGTIG ARATGTTCCG YAA 23 



25 



55 



2) INFORMATION FOR SEQ ID NO: 644 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644 

35 

ACGTCIGTIG TICKGAARTA GAA 2 3 



40 2) INFORMATION FOR SEQ ID NO: 645 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645 

ACGTCIGTIG TICKGAARTA RAA 2 3 



2) INFORMATION FOR SEQ ID NO: 64 6 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
60 (B) TYPE: Nucleic acid 
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10 



45 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646 
ATCGACAAGC CITTCYTIAT GSC 2 3 

2) INFORMATION FOR SEQ ID NO: 647 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647 
ACGTCCGTSG TRCGGAAGTA GAACTG 2 6 

25 

2) INFORMATION FOR SEQ ID NO: 648 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 8 

4 0 ACGTCSGTSG TRCGGAAGTA GAACTG 2 6 



2) INFORMATION FOR SEQ ID NO: 649 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 9 

55 

GTCCTATGCC TCARACWCGI GAGCAC 2 6 



60 2) INFORMATION FOR SEQ ID NO: 650 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650 

TTACGGAACA TYTCAACACC IGT 23 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 651 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651 
TGACGACCAC CITCYTCYTT YTTCA 2 5 

2) INFORMATION FOR SEQ ID NO: 652 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652 
CCWAYAGTIY KICCICCYTC YCTIATA 2 7 

45 

2) INFORMATION FOR SEQ ID NO: 653 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653 
60 GAYTTCATIA ARAAYATGAT 2 0 
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2) INFORMATION FOR SEQ ID NO: 654 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654 

15 

TACAARATYK GIGGTATYGG 2 0 

2 0 2) INFORMATION FOR SEQ ID NO: 655 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655 

CCRATACCI C MRATYTTGTA 2 0 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 656 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
4 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656 
AATTAATGGC TGCAGTTGAY GA 22 

2) INFORMATION FOR SEQ ID NO: 657 



(i) SEQUENCE CHARACTERISTICS: 
5 5 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 657 
TTGTCCACGT TCGATRTCTT CA 22 

5 

2) INFORMATION FOR SEQ ID NO: 658 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658 
2 0 GATYTAGTCG ATGATGAAGA ATT 23 



25 



55 



2) INFORMATION FOR SEQ ID NO: 659 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659 

35 

GCTTTTTGIG TTTCWGGTTT RAT 2 3 



40 2) INFORMATION FOR SEQ ID NO: 660 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660 

GTAGAATTGA GGACGGTAGT TAG 2 3 



2) INFORMATION FOR SEQ ID NO: 661 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
60 (B) TYPE: Nucleic acid 
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10 



25 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661 
GTAGAAYTGT GGWCGATART TRT 2 3 

2) INFORMATION FOR SEQ ID NO: 662 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662 



CGGCGCAATC CTCGTTGTTG CTGCCACCGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACGT TCTGCTCGCT CGCCAGGTCG GCGTTCCTTA CATCCTCGTT 100 

3 0 GCTCTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

CGAGATGGAG ATCCRTGAGC TGCTCGCTGA GCAGGATTAC GACGAAGAGG 2 00 

CTC CAATCAT CCACATCTCC GCAC TGAAGG CTCTTGAGGG CGACGAGAAG 250 

TGGACCCAGT CCATCATCGA CCTCATGCAG GCTTGCKATG ATTCCATCCC 3 00 

AGACCCAGAG CGTGAGACCG AC AAGCCATT CCTCATGCCT ATCGAGGACA 350 

35 TCTTCACCAT CACCGGCCGC GGTACCGTTG TTACCGGC CG TGTTGAGCGT 4 00 

GGCTCCCTGA AGGTCAACGA GGACGTCGAG AT CATCGGTA TCCGCGAGAA 450 

KGCTACCACC ACCACCGTTA CCGGTATCGA GATGTTCCGT AAGCTTCTCG 500 

ACTACACCGA GGCTGGCGAC AACTGTGGTC TGCTTCTCCG TGGCGTTAAG 550 

CGCGAAGACG TTGAG CGTGG CCAGGTTGTT GTTAAGCCAG GCGCTTACAC 600 

4 0 CCCTCACACC GAGTTCGAGG GCTCTGTCTA CGTTCTGTCC AAGGACGAGG 650 

GTGGCCGCCA CACCCCATTC TTCGACAACT ACCGCCCACA GTTCTACTTC 7 00 

CGCACCACCG ACGTTACCGG TGTTGTGAAG CTTCCTGAGG GCACCGAGAT 750 

GGTCATGCCT GGCGACAACG TCGACATGTC CGTCACCCTG ATCCAGCCTG 800 

TCGCTATGGA TGAGGGCCTG CGCTTCGCTA TC 832 

45 



2) INFORMATION FOR SEQ ID NO: 663 

50 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1192 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 
60 (B) STRAIN: ATCC 10565 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663 

AACGGCGAGC AAGACTTGGT GTTGGAGGTT TCTCAGCACT TGGGTGAGAA 5 0 

5 CACCGTGCGT ACCATTGCCA TGGACGGTAC CGAGGGTTTG GTGAGAGGTA 10 0 

CCGCTGTCAC TGACACCGGT GCTCCCATCT CGGTCCCCGT TGGTCAGGGT 15 0 

ACCTTGGGCC GGATCATCAA CGTTGTCGGT GAGCC CAT CG ACGAGCGTGG 200 

TCCCATCCAG TGCAAGCAGA GAAACCCCAT TCACGCCGAG CCCCCGTCTT 250 

TCACCGAGCA GTCCGTCGAG GCTGAGGTGT TGGAGAC CGG TATCAAGGTT 3 00 

10 GTCGACTTGT TGGCTCCCTA CGCCCGTGGT GGTAAGATTG GTCTTTTCGG 3 50 

TGGTGCCGGT GTCGGTAAGA CCGTGTTCAT CCAGGAGTTG ATTAACAACA 4 00 

TTGCCAAGGC CCACGGTGGT TTCTCCGTGT TCACTGGTGT CGGTGAGCGT 450 

AC TCGTGAGG GTAACGACTT GTACCGTGAA ATGAAGGAGA CCGGTGTCAT 500 

CAACTTGGAG GGCGACTCCA AGGTGGCCTT GGTGTTCGGT CAGATGAACG 550 

15 AGCCCCCGGG GGCTCGTGCC CGTGTCGCCT TGACCGGTTT GACCATTGCC 6 00 

GAGTACTTCC GTGACGAGGA AGGC CAGGAC GTGTTGTTGT TCGTTGACAA 650 

CATTTTCAGA TTCACCCAGG CCGGTTCCGA GGTGTCGGCG TTGTTGGGTC 700 

GTATCCCCTC CGCCGTCGGT TACCAGCCCA CTTTGGC CAC CGACATGGGT 750 

TTGTTGCAGG AGAGAATTAC CACCACCAAG AAGGGTTCCG TCACCTCTGT 8 00 

2 0 GCAGGCCGTG TACGTCCCTG CCGATGACTT GACTGACCCT GCCCCCGCCA 850 

CCACTTTCGC TCACTTGGAC GCCACCACCG TGTTGTCGCG TGGTATCTCC 900 

GAGTTGGGTA TCTACCCCGC CGTCGACCCC TTGGACTCCA AGTCGAGATT 950 

GT TGGACGTC GAGGTTGTTG GCCAGGAGCA CTACGACGTC GCCACCGGTG 1000 

TCCAGGAGTG CTTGCAGGCC TACAAGTCGT TGCAGGACAT CATTGCCATT 1050 

2 5 TTGGGTATGG ACGAGTTGTC CGAGCAGGAC AAGTTGACCG TCGAGAGAGC 1100 

CCGTAAGATC CAGCGTTTCT TGTCGCAGCC CTTCGCTGTC GCCGAGGTTT 1150 

TCACTGGTAT CCCCGGTAGA TTGGTGAGAT TGCAGGACAC CG 1192 



30 



2) INFORMATION FOR SEQ ID NO: 664 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664 
AAYATGAT I A CIGGIGCIGC I CARATGGA 2 9 

45 

2) INFORMATION FOR SEQ ID NO: 665 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 
.(C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from X00779 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665 

ATGGGTAAAG AGAAGT CTC A CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

5 TTGACAAGAG AAC CAT CGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 2 50 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

10 TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAAC C A 4 00 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 4 50 

GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAAT C CA GATTCCAAGA 50 0 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

15 GAAGCTAC CA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

AACAACCATC TAGACCAACT G AC AAGC CAT TGAGATTGCC ATTGCAAGAT 75 0 

GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 8 50 

2 0 CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 95 0 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT C C AC CAAAGG 100 0 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAAC CA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 110 0 

25 TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 12 00 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

AC CATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 13 0 0 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

3 0 GCTGCTCAAA AGGCTGCTAA GAAATAA 137 7 



2) INFORMATION FOR SEQ ID NO: 666 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Saccharomyces cereviaiae 

(C) ACCESSION NUMBER: extracted from M12082 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 

50 ATGGTTTTGC CAAGACTATA TACTGCTACA TCCCGTGCTG CTTTTAAAGC 50 

AGC CAAACAA TCCGCTCCGC TTCTATCCAC TTCGTGGAAA AGATGTATGG 100 

CCTCAGCTGC TCAATCTACT CCAATCACCG GTAAAGTTAC CGCTGTCATT 150 

GGTGCCATTG TTGACGTTCA TTTTGAACAA TCAGAGTTGC CCGCTATTTT 2 00 

GAACGCTTTA GAAATTAAAA CACCTCAAGG TAAGTTGGTT TTGGAAGTTG 250 

5 5 CTCAACATTT GGGTGAAAAC ACTGTCAGAA CCATTGCTAT GGATGGTACC 3 00 

GAAGGTTTGG TCCGTGGTGA AAAGGTTCTT GACACTGGTG GCCCTATCTC 350 

CGTCCCAGTT GGGAGAGAAA CTTTAGGGAG AATCATCAAC GTTATCGGTG 4 00 

AACCTATTGA TGAAAGAGGT CCAATTAAGT CCAAACTAAG AAAGCCAATT 4 50 

CACGCAGACC CTCCTAGTTT TGCAGAACAA TCTACTTCGG CTGAAATTTT 500 

60 GGAAACAGGT ATCAAAGTCG TCGATCTATT AGCTCCTTAT GCCAGAGGTG 550 
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GTAAGATTGG TCTTTTCGGT GGTGCAGGTG TCGGTAAGAC TGTGTTCATT 600 

CAAGAATTGA TTAACAATAT CGCCAAGGCC CATGGTGGTT TTTCCGTTTT 650 

CGCCGGTGTT GGTGAAAGGA CCAGAGAGGG TAATGACTTG TACCGTGAAA 700 

TGAAGGAAAC TGGAGT C ATT AACTTGGAAG GTGAATC CAA GGTCGCCTTA 750 

5 GTTTTCGGTC AAATGAACGA ACCTCCAGGA GCCAGAGCCA GAGTCGCTTT 800 

AACTGGTTTG ACGATCGCTG AATATTTCAG AGATGAAGAA GGTCAAGACG 8 50 

TCTTGTTGTT TATCGACAAT ATCTTTAGAT TTACTCAAGC TGGTTCAGAA 9 00 

GTCTCTGCCC TTTTGGGTCG TATTC CATC T GCCGTCGGTT ATCAACCAAC 95 0 

TTTGGCCACT GATATGGGTC TCTTACAAGA AAGAATTACC ACCACAAAGA 100 0 

10 AGGGTTCTGT CACTTCTGTG CAAGCCGTTT ATGTTCCAGC CGATGATTTA 1050 

ACAGATCCGT CTCCGTCCAC ATCTTTTGCC CATTTGGACG CATCATCCGT 110 0 

CTTGTCAAGA GGTATTTCAG AATTAGGTAT TTACCCTGCA GTGGATCCAT 1150 

TGGATTCTAA ATCAAGGTTA TTGGATGCCG CCGTTGTCGG TCAAGAACAT 12 00 

TATGACGTCG CCTCCAAGGT TCAAGAAACT TTACAGACCT ATAAATCTTT 125 0 

15 ACAAGATATC ATTGCTATTT TGGGTATGGA TGAATTGTCC GAACAAGATA 13 00 

AACTAACTGT CGAAAGGGCA AGAAAGATTC AAAGATTCTT ATCTCAACCA 13 50 

TTTGCTGTCG CCGAAGTCTT TACTGGTATC CCAGGTAAAT TAGTGAGATT 14 00 

AAAGGACACC GTTGCCTCGT TCAAAGCCGT TTTGGAAGGT AAATACGATA 14 50 

ATATACCAGA ACATGCTTTC TATATGGTTG GTGGTATTGA AGATGTTGTT 15 0 0 

2 0 CGTAAAGCTG AAAAGTTAGC CCGTGAAGCC AACTAG 153 6 



2) INFORMATION FOR SEQ ID NO: 667 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: Y 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 

4 0 CGGAAGGCGT GCCGCCGGTG TTAACTGCTT TGGATGTGGT GGAAAAACTT 50 

GGC CGTGACG AGCCCTTGAC ACTTGAAATT GTACAGCACT TGGACGCCCA 100 

CACCGGCCGT TGCATTGCGA TGCAGACGAC AGATCTGCTT AAGCTGAAAT 150 

CGAAGGTAGT CTCGAGTGGC GGGAACATTT CTGTGCCTGT GGGTCGGGAG 200 

ACACTTGGCA GAAT CTTCAA TGTGC TGGGC GATGCCATTG AC CAGCGC GG 250 

45 CCATGTTGGA GAGAAGCAAC GCATGCCTAT TCACGCTGTG GCACCAAAGT 3 00 

TGGCGGATCA GGCGGCAGAG GACACCATTC TCACAACGGG TATCAAGGTG 3 50 

ATTGAT CTGA TTTTGCCCTA CTGCAAGGGT GGGAAGATTG GTCTTTTTGG 400 

TGGTGCTGGC GTGGGCAAGA CGGTTATTAT CATGGAACTT ATTAACAACG 450 

TTGCCAAGGG CCATGGTGGG TTCTCCGTCT TTGCGGGTGT TGGTGAACGC 50 0 

5 0 ACGCGTGAGG GGACGGATCT TTACCTGGAG ATGATG CAAT CCAAAGTTAT 55 0 

TGAC CTGAAG GGTGACTCGA AGTGTGTGTT GGTGTATGGT CAGATGAACG 600 

AGCCTCCGGG TGCCCGTGCT CGTGTGGCGC AGTCTGCCTT GACAATGGCC 65 0 

GAGTACTTCC GTGATGTGGA AGGGCAAGAC GTGTTGCTTT TTATCGACAA 7 00 

CATTTTCCGC TTTACGCAGG CAAACTCTGA GGTGTCAGCG CTGTTGGGTC 750 

55 GTATTCCCGC TGCCGTCGGC TAC CAGCCTA CCCTTGCTGA GGATCTTGGG 800 

CAGTTGCAGG AGCGCATTAC GTC CACGACA AAAGGTTCCA TTACCTCTGT 850 

GCAGGCTGTG TACGTGCCAG CCGATGATAT TACCGACCCT GCGCCAGCAA 900 

CGACCTTTTC CCACCTCGAT GCCACGACGG TGCTGGACCG TGCCGTTGCC 950 

GAATCCGGCA TTTACCCCGC TGTCAACCCA CTGGAGTGTG CCTCGCGTAT 10 00 

60 CATGGACCCG GATGTAATCA GCGTTGACCA CTACAACGTG GCGCAGGATG 1050 
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TGGTGCAGAT GCTTACCAAG TACAAGGAGC 

CTTGGCATTG ATGAGCTCAG TGAGGAGGAT 

GCGTAAGGTG ACAAAGTTTC TCTCCCAGCC 

TTACTGGCAT GACAGGC CAC TACGTGCAGC 

5 TTTTCCGGCC TGTTGATGGG CACATATGAT 



TGCAGGATAT CATTGCGGTG 110 0 

AAACTTATCG TGGACCGTGC 1150 

TTTCCAGGTG GCGGAGGTGT 12 00 

TGGAGGAGAC CATTGAGTCT 12 50 

CAGGTGCCGG AGA 12 93 



2) INFORMATION FOR SEQ ID NO: 668 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: CoryneJbacteriu/n glutamicum 

(C) ACCESSION NUMBER: X77034 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668 

25 GTGGCAAAGG CGAAGTT CGA GCGTACCAAG CCCCACGTAA ACATCGGCAC 50 

CATCGGTCAC GTTGACCACG GTAAGACCAC CACCACCGCG GCTATCACCA 100 

AGGTTCTGGC TGACACTTAC CCTGAGCTCA ACGAGGCTTT CGCCTTCGAC 150 

TCCATCGATA AGGCTCCTGA GGAGAAGGAG CGTGG CATC A CGATCAACAT 200 

CTCCCACGTT GAGTACCAGA CTGAAAAGCG CCACTACGCA CACGTTGACG 250 

3 0 CTCCAGGCCA CGCCGACTAC ATCAAGAACA TGATTACCGG CGCTGCTCAG 300 

ATGGACGGCG CAATCCTCGT TGTTGCTGCT ACCGACGGCC C AATGCCT CA 350 

GACCCGTGAG CACGTTCTTC TTGCTCGC CA GGTTGGCGTT CCTTACATCC 400 

TCGTTGCTCT TAACAAGTGC GACATGGTTG AGGATGAGGA AATCATCGAG 450 

CTCGTCGAGA TGGAAGTTCG TGAACTTCTT GCTGAGCAGG AC TACGACGA 500 

3 5 AGAGGCTCCA ATTGTTCACA TCTCCGCTCT GAAGGCTCTT GAGGGCGACG 550 

AGAAGTGGGG CAAGCAGATC CTTGAGCTCA TGCAGGCTTG CGATGACAAC 600 

ATCCCTGACC CAGT TCGTGA GACCGACAAG CCATTCCTCA TGCCTATCGA 650 

GGACATCTTC ACCATCACCG GTCGTGGCAC CGTTGTTACC GGTCGTGTTG 700 

AGCGCGGTAC CCTGAACGTG AACGATGATG TTGACATCAT CGGCATCAAG 750 

4 0 GAGAAGTCCA CCTCCACCAC CGTTACCGGT AT CGAGATGT TCCGTAAGCT 800 

TCTTGACTCC GCTGAGGCTG GCGACAACTG TGGTCTGCTT CTCCGTGGTA 850 

TCAAGCGCGA AGATGTTGAG CGTGGCCAGG TTATCGTTAA GCCAGGCGCT 900 

TACACCCCTC ACACCGAGTT CGAGGGCTCT GTCTACGTTC TGTCCAAGGA 950 

TGAAGGTGGC CGCCACACCC CATTCTTCGA CAACTACCGT CCTCAGTTCT 1000 

45 ACTTCCGCAC CACCGACGTT ACCGGTGTTG TGAAGCTTCC AGAGGGCACC 105 0 

GAGATGGTCA TGCCTGGCGA CAACGTCGAC ATGTCCGTCA CCCTGATCCA 1100 

GCCTGTCGCT ATGGACGAGG GCCTGCGTTT CGCTATCCGC GAAGGCTCCC 1150 

GCACCGTTGG CGCTGGTCGT GTCACCAAGA TCATCAAGTA A 1191 



50 



2) INFORMATION FOR SEQ ID NO: 669 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 13 83 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from V00267 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669 

ATGGCTACTG GAAAGATTGT CCAGGTAATC GGCGCCGTAG TTGACGTCGA 50 

ATTCCCTCAG GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTG C AAA 100 

10 ATGGTAATGA GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT 150 

ATCGTACGTA CCATCGCAAT GGGTTCCTCC GACGGTCTGC GTCGCGGTCT 200 

GGATGTAAAA GACCTCGAAC ACCCGATTGA AGTCCCGGTA GGTAAAGCGA 2 50 

CTCTGGGCCG TATCATGAAC GTACTGGGTG AACCGGTCGA CATGAAAGGC 3 00 

GAGATCGGTG AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA 350 

15 CGAAGAGCTG TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA 400 

TCGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT 450 

GGTGCGGGTG TAGGTAAAAC CGTAAACATG ATGGAGCTCA TTCGTAACAT 500 

CGCGATCGAG CACTCCGGTT AC.TCTGTGTT TGCGGGCGTA GGTGAACGTA 550 

CTCGTGAGGG TAACGAC TTC TACCACGAAA TGACCGACTC CAACGTTATC 600 

2 0 GACAAAGTAT CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG 650 

TCTGCGCGTT GCTCTGACCG GT CTGAC CAT GGCTGAGAAA TTCCGTGACG 70 0 

AAGGT CGTGA CGTTCTGCTG TTCGTTGACA ACATCTATCG TTACACCCTG 750 

GCCGGTACGG AAGTATCCGC ACTGCTGGGC CGTATGCCTT CAGCGGTAGG 8 00 

TTATCAGCCG ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTAT CA 850 

25 CCTCCACCAA AACTGGTTCT ATCACCTCCG TACAGGCAGT ATACGTACCT 900 

GCGGATGACT TGACTGACCC GTCTCCGGCA ACCACCTTTG CGCACCTTGA 950 

CGCAACCGTG GTACTGAGCC GTCAGATCGC GTCTCTGGGT ATCTACCCGG 1000 

CCGTTGACCC GCTGGACTCC ACCAGCCGTC AGCTGGACCC GCTGGTGGTT 1050 

GGTCAGGAAC ACTACGACAC CGCGCGTGGC GTTCAGTCCA TCCTGCAACG 1100 

3 0 TTATCAGGAA CTGAAAGACA TCATCGCCAT CCTGGGTATG GATGAACTGT 1150 

CTGAAGAAGA CAAACTGGTG GTAGCGCGTG CTCGTAAGAT CCAGCGCTTC 12 00 

CTGTCCCAGC CGTTCTTCGT GGCAGAAGTA TTCACCGGTT CTCCGGGTAA 1250 

ATACGTCTCC CTGAAAGACA CCATCCGTGG CTTTAAAGGC ATCATGGAAG 13 00 

GCGAATACGA TCACCTGCCG GAGCAGGCGT TCTACATGGT CGGTTCCATC 1350 

3 5 GAAGAAGCTG TGGAAAAAGC CAAAAAACTT TAA 13 83 



40 
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2) INFORMATION FOR SEQ ID NO: 670 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1410 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Helicobacter pylori 

(B) STRAIN: NCTC 11638 

(C) ACCESSION NUMBER: extracted from AF004014 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 0 



ATGAAAGCGA TGGAAGGTAA AATCATTCAG GTTTTAGGCC CGGTGGTAGA 50 

TGTGGAGTTT GAATC CTATC TGCCGGCGAT TTTTGAAGCA CTAGACATTA 100 

ATTTTGAAGT TAATGGCGTT CAAAAATCTT TAGTTTTAGA GGTGGCAGCC 150 

CATTTGGGTG GTAATCGGGT GCGAGCGATT GCTATGGATA TGACAGAAGG 2 00 

6 0 CTTAGTGCGT AACCAAGCCG TCAAAGCTCG TGGCAAAATG ATTGAAGTGC 250 
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CTGTGGGCGA AGAAGTGTTA GGGCGTATTT TTAATGTTGT GGGCGAGAGC 3 00 

ATTGATAATT TAGAGCCGCT TAAGCCGTCC TTAACTTGGC CCATTCACAG 350 

AAAAGCCCCT AGTTTTGAGC AGCAAAGCAC TAAAACAGAA ATGTTTGAAA 40 0 

CCGGTATTAA AGTCATTGAC TTGCTCGCGC CTTATTCTAA GGGCGGTAAA 450 

5 GTAGGCTTGT TTGGTGGGGC TGGCGTAGGC AAAACGGTGA TCATTATGGA 50 0 

GCTTATCCAC AATGTGGCTT ATAAGCATAA CGGGTATTCG GTGTTTGCAG 550 

GTGTGGGGGA GCGCACCAGA GAAGGGAACG ATCTGTATTT TGAGATGAAA 60 0 

GAAGGGGGCG TTTTAGACAA AGTTGCGTTG TGCTATGGGC AAATGAATGA 65 0 

GCCACCAGGT GCAAGGAATC GCATCGCATT CACCGGCTTG ACGATGGCGG 70 0 

10 AGTATTTCCG TGATGAAAAG GGCTTAGATG TGTTGATGTT TATTGATAAC 75 0 

ATCTTTAGAT ACGCTCAAAG CGGTGCGGAA ATGAGCGCGC TATTAGGCCG 80 0 

TATCCCTTCA GCGGTGGGGT ATCAGCCCAC GCTAGCCGGG GAAATGGGGA 85 0 

AACTTCAAGA GCGTATCGCT TCCACTAAAA ATGGCTCTAT CACTTCGGTT 90 0 

CAAGCGGTGT ATGTGCCAGC AGACGACTTG ACTGACCCAG CCCCTGCTTC 950 

15 GGTGTTTGCG CATTTAGATG CGACTACGGT GTTGAATAGA AAGATCGCTG 100 0 

AAAAAGGGAT TTATCCGGCG GTGGATCCTT TGGATTCCAC TTCAAGGATT 1050 

TTAAGCCCTC AAATGATTGG CGAGAAGCAC TATGAAATCG C C AC CGGT AT 1100 

CCAGCAAGTT TTGCAAAAAT ACAAGGATTT GCAAGATATT ATTGCGATTT 1150 

TGGGATTGGA CGAATTGAGC GAAGAGGATA AAAAAACGGT TGAAAGGGCC 12 0 0 

2 0 AGAAAAATTG AGAAGTTTTT ATCCCAGCCG TTTTTTGTGG CTGAAGTGTT 1250 

TACAGGAAGT CCCGGTAAGT ATGTGACTCT C CAAGAGACT TTAGAGGGCT 130 0 

TTGGAGGGAT TTTAGAGGGC AAATACGATC ACATTCCTGA AAACGCGTTT 1350 

TACATGGTGG GCAGCATTCA AGAGGTTTTA GAAAAAGCTA AAAAC AT GAA 1400 

AAATTCCTAA 1410 

25 



2 ) INFORMATION FOR SEQ ID NO: 671 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 01 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium acetobutylictun 
4 0 (B) STRAIN: DSM 7 92 

(C) ACCESSION NUMBER: extracted from AF101055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671 

45 ATGCCAGAAC ATGTAGGTAA AATTGTT CAG GTAATAGGAC CTGTTGTGGA 5 0 

TATTAAATTT GATG CAGAGA ACCTTCCTGA CATCTATAAT TCCATAGAAA 100 

TAGATATGGG AGATAATAAA AAAC TCATTG CTGAAGTTGA ACAACATGTA 150 

GGAGATGACA TAGTAAGAAC AATAGCAATG GAAGGTACTG ACGGATTAAA 200 

AAGAGGAATG GAAGCAGTTA ACACTGGTAA ACCAATATCT GTAC CAGTTG 250 

50 GAGAAAATGT TTTAGGACGT CTTTTTAATG TTTTAGGTCA GACAATAGAT 300 

GAAGCAGGAG ACATGAATGC TGATAAGTAT TATCCAATTC ATAGACCAGC 350 

TCCAACCTTT GAAGAACAAT CAGTTCAACC AGAAATGTTT GAAACAGGTA 40 0 

TTAAGGTTAT AGATTTACTT GCTCCATATC AAAAGGGTGG AAAAATCGGT 450 

TTGTTCGGTG GTGCCGGTGT TGGTAAAACA GTTCTTATTC AGGAACTTAT 500 

55 AAATAATATA GCAAAAGAAC ACGGTGGATT ATCAGTATTC ACAGGTGTTG 55 0 

GAGAAAGAAC AAGAGAAGGT AATGACCTTT ATTATGAAAT GAAAGATT CA 60 0 

GGAGTTATAA AT AAAAC AG C TCTAGTATTT GGTCAGATGA ATGAACCACC 650 

TGGCGCTAGA ATGAGAGTTG CTTTAACAGG ACTTACAATG GCTGAATATT 700 

TTAGAGACAA AGGTCAAGAT GTGCTTCTAT TTATAGATAA TATATTCAGA 750 

60 TTTACACAAG CTGGTTCAGA GGTTTCAGCT TTACTTGGTA GAATACCTAG 80 0 
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TGCCGTTGGT TATCAGCCAA CACTTGCAAA TGAAATGGGT GCTCTTCAAG 850 

AGAGAATAAC ATCAACAAAA CAGGGTTCAA TCACATCCGT TCAGGCTGTA 90 0 

TATGTTCCTG CTGATGACCT TACAGACCCA GCTCCAGCAA CAACATTTAC 950 

GCATCTTGAT G CAACAAC AG TTCTTTCAAG AGAAATATCA AACTTAGGAA 1000 

5 TATATCCTGC AGTTAGTCCT CTTGAATCAA CTTCAAGAAT ACTTGATCCA 1050 

AGAATTGTTG GAGAAGAGCA TTATGAAGTT GCTAACAAGG TTAAACATAT 1100 

ACTTGAAAGA TATCAAGAAC TTCAAGATAT CATAGCTATA CTTGGTGTTG 1150 

ATGAACTTTC AGATGAGGAT AGATTGTTAG TTGGAAGAGC AAGAAGAGTA 12 0 0 

CAGAGATTCT TATCTCAAGC TTTTAGTGTT GCTGAACAAT TTACAGGAAT 1250 

10 GAAAGGT GAG TTTGTACCTG TAAAAGATAC TATAAGAAGT TTTAAAGAAA 13 00 

TATTAGATGG TAAGTGTGAT GATCTTCCAG AAGCTGCATT TTTATTTGCA 1350 

GGAACAATAG AAGATGTAAA AGAAAAAGCT AAAAAAATGA TGGAAAGCTA 14 00 

A 1401 

15 

2) INFORMATION FOR SEQ ID NO: 672 

(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 1509 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cytophaga lytica 

(B) STRAIN: DSM 2039 

30 (C) ACCESSION NUMBER: M22535 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:" 672 

ATGTCTAAAG TTACAGGTAA AGTTTCCCAA ATTATTGGCC CAGTTATAGA 50 

35 TGTGGAGTTC CAAGCAGGGG TAGATCTTCC AAAAATTTAT GATTCATTAG 100 

AAATTAAAAA AGCAGATGGA TCAATTTTGG TTTTGGAAGT ACAATCACAC 15 0 

ATTGGTGAGA ACACAGTAAG AACTATATCT ATGGATTCTT CTGATGGTTT 200 

AAGTCGTGGA GCAGAGGTTA ATG CAACAGG AAGCGCTATA CAAATGC CAG 250 

TTGGAGATGA CGTTTACGGA CGTTTATTTA ACGTAATTGG AGACGCTATT 3 00 

40 GATGGTCTTG GGAATTTACC AAAATCTGGT AAAGATGGTT TGCCAATACA 3 50 

C AGAGAGG CA CCAAAATTTG AAGACTTATC TACTTCTACA GAAGTATTAT 4 00 

TTACAGGTAT TAAAGTAATT GACCTTATTG AGCCTTATGC AAAAGGTGGT 450 

AAGATTGGTT TATTTGGAGG TGCCGGAGTA GGTAAAACAG TATTAATT C A 500 

GGAATTAATT AACAACATTG CAAAAGGTCA CGGTGGACTT TCTGTATTTG 550 

45 CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ACGATTTACT ACGTGAGATG 600 

TTAGAGTCTG GTATTATTAA ATACGGAGAT GACTTTATGC ACTCTATGGA 650 

AGAAGGTGGT TGGGATTTAT CTAAAGTTGA TAAATCTGTA ATGAAAGATT 700 

CTAAAGCAAC CTTTGTATTT GGACAAATGA ATGAGCCACC AGGAGCACGT 75 0 

GCACGTGTTG CATTATCTGG TTTAACTATT GCAGAATATT TCCGTGATGG 800 

50 AGCAGGTGAA GGTCAAGGTA AAGATGTACT TTTCTTTGTG GATAACATTT 8 50 

TCCGTTTTAC ACAAGCTGGT TCTGAGGTAT CTGCATTACT TGGTCGTATG 90 0 

CCATCTGCGG TAGGTTACCA ACCAACATTA GCAACAGAAA TGGGTGCTAT 950 

GCAAGAGCGT ATTACATCAA CAAAAAGAGG TTCTATTACA TCTGTACAGG 100 0 

CGGTTTACGT ACCTGCGGAT GATTTAACGG ATCCAGCACC GGCAACTACC 105 0 

55 TTTGCTCACT TAGATGCAAC AACGGTATTG TCTCGTAAAA TTGCAGAGTT 110 0 

AGGTATTTAC CCAGCGGTAG ATCCATTAGA TTCTACTTCT AGAATCTTAG 1150 

CTCCAGAAAT TTTAGGAAAA GATCACTACT CTTGTGCACA ACGTGTAAAA 12 0 0 
GAGTTGTTAC AACGTTATAA AGAATTACAA GATATTATTG CTATCCTTGG . 12 50 

TATGGAAGAA TTATCTGAGG AAGATAAAAT GGCAGTTGGT AGAGCAAGAC 13 00 

60 GTGTACAACG TTTCTTATCT CAGCCTTTCC ACGTAGCAGA GCAAT TTACA 13 5 0 
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15 



45 



GGTC TTAAAG GTGTTTTAGT AGATATCAAG GATACTATTA AAGGATTTAA 14 00 

TATGATTATG GATGGTGAAT TAGATCACTT ACCAGAATCT GCATTTAACC 14 50 

TTAAAGGTAC TATTGAAGAA GCTATAGAAG CAGGAGAAAA AATGCTTGCT 150 0 

GAAGCATAA 1509 



2) INFORMATION FOR SEQ ID NO: 673 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia rlsticll 
20 (B) STRAIN: HRC-IL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673 

CCTAAAATAT ATGATGCATT ATATGTAAAA CTAGATAATG AAAATTTGTG 5 0 

25 TTTAGAAGTT TCACAAATTA TTGGAGATAA TGTTGTTAGA TGTATTG CAA 100 

TGGGAGCTAC TTATGGATTA AATCGTGGTT TAGAAGTAGT TTGTTCAGGA 150 

AATC CAATTC AGGTTCCTGT AGGTGAACAA GTTTTAGGTA GAATGTTTAA 2 00 

TGTTGTTGGT AAAACAATTG ACAATCTTGA ATCTTTAGAT GATAAAAATA 25 0 

TAAAAATGAT GCCAATTCAT CGAAAT C CAC CATCATTTGA AGAGCAATCC 3 00 

30 AATGAAATTG AAATTTTTGA AACAGGCATT AAAGTTATTG ATTTATTAAT 3 50 

TCCATATGCT AAAGGTGGTA AGATTGGATT ATTTGGTGGA GCAGGGGTTG 4 00 

GGAAAACGGT TCTTGTTCAA GAATTAATTC ACAATATCGC AAAAGGT CAT 4 50 

GGTGGTCTAT CTGTTTTTGC TGGAGTTGGT GAAAGAACTC GTGAAGGTAA 5 00 

TGACTTGTAT TATGAAATGA TTGAAGGTGG AGTTATAGAT AAAACAGCCT 550 

35 TAGTGTTTGG GCAAATGAAT GAACCTCCTG GCGCAAGAAT GCGCGTAGCA 600 

TTAACTGCTT TAACAATGGC TGAATATTTC CGTGATGTTC AAAAC C AAGA 6 50 

TGTTTTGTTA TTTATTGATA ATATCTTTAG ATTTACACAA GCTGGTAGTG 7 00 

AAGTTT C AG C ATTATTAGGA AGAATGC CAT CTGCTGTTGG TTATCAAC CA 7 50 

ACTTTGGCAT ATGAAATGGG ATTGTTACAA GAAAGAATCA CTTCCACTAA 800 

4 0 AAGTGGTTCT ATAACATCT 819 



2) INFORMATION FOR SEQ ID NO: 674 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674 

60 AGAGCGAAGT ACCAAGTGTT TACGATGCTC TGAATGTTGT GGATTCCAAA 50 
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GAACGTCTGG TTCTGGAAGT TCAACAGCAG 
CGCTATCGTT ATGGGTAGCT CGGATGGTTT 
AAAACACTGG CGCTCCAATT TCAGTACCAG 
CGTATCATGA ACGTGCTTGG TGATGCGATT 
5 CGCAGAAGAG GTGTACTCGA TTCACCGTCC 
AGTCTAGTGC AACTGAACTT TTGGAAACGG 
ATCTGTCCGT TTGCGAAAGG CGGTAAAATC 
TGTAGGTAAG AC CGTTAACA TGATGGAACT 
AGCACTCAGG TTTGTCAGTA TTTGCTGGGG 

10 GGTAACGACT TCTACCACGA AATGCAGGAA 
ACAACCAGAA CTGTCGAAAG TAGCGATGGT 
CACCAGGCAA CCGTCTGCGT GTAGCACTGA 
AAGTTCCGTG ATGAAGGCCG TGACGTACTG 
CCGTTACACC CTAGCGGGAA CGGAAGTATC 

15 CTTCAGCGGT AGGTTACCAA CCAACACTGG 
CAAGAACGTA TCACGTCAAC CAAAAAAGGT 



CTAGGCGGTG GCGTGATTCG 10 0 

ACGTCGTGGA ATGACAGTAC 15 0 

TAGGTACTAA AACCCTAGGT 200 

GACGAACGTG GCGACATTGG 250 

TGCTCCAAGC TACGAAGAAC 3 00 

GTGTTAAGGT TATCGACCTG 3 50 

GGTCTGTTCG GTGGTGCGGG 40 0 

TATCAACAAC ATCGCGCTAC 450 

TAGGTGAGCG TACTCGTGAG 500 

GCGGGCGTTG TAAACGTTGA 550 

TTACGGTCAG ATGAACGAGC 600 

CTGGTCTGAC TATGGCGGAA 650 

CTGTTTATCG ACAACATCTA 700 

TGCTCTGCTT GGCCGTATGC 750 

CTGAAGAGAT GGGTGTTCTG 800 

TCTATCACCT 84 0 



20 2) INFORMATION FOR SEQ ID NO: 675 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675 



CGGCGGTATC CTAGTTGTAG CGGCAACTGA CGGTCCAATG CCACAAACTC 5 0 

GTGAGCACAT CCTGCTGGGT CGCCAAGTAG GTATTCCTTA CATCATCGTA 100 

TTCATGAACA AGTGTGACAT GGTTGACGAT GAAGAGCTTC TAGAGCTGGT 150 

AGAGATGGAA GTTCGTGAGC TGCTGTCTGA GTACGATTTC C CAGGTGATG 2 00 

4 0 AC CTGCCAGT AATCCAAGGT TCAGCACTAG GCGCGCTAAA CGGCGAAGCA 2 50 

CAGTGGGAAG CGAAGAT TGT TGAGCTAGCA GAAGCACTGG ATACTTAYAT 3 00 

TCCAGAGCCA GAGCGTGCAG TAGACATGGC ATTCCTGATG CCAATCGAAG 3 50 

ACGTATTCTC AATCCAAGGT CGTGGTACAG TAGTAACTGG CCGTATCGAG 4 00 

CGCGGCATCC TGAAAGTGGG TGACGAAGTA GCGATCGTTG GTATCAAAGA 4 50 

45 GACAGTAAAA ACGACCTGTA CAGGTGTAGA GATGTTCCGT AAGCTGCTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG CACTRCTACG TGGTACTAAG 550 

CGTGAAGAAG TAGAGCGTGG TCAAGTACTG GCGAAGCCAG GTTCAATCAC 6 00 

ACCACACACT AAGTTCGAAT CAGAAGTATA CGTACTGTCA AAAGATGAAG 6 50 

GTGGCCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 7 00 

50 CGTACAACTG ACGTAACAGG CAGCATTGAG CTACCAGARG GCGTAGAAAT 750 

GGTAATGCCA GGCGACAACG TGAAGATGGT TGTAGACCTG ATTGCACCAA 8 00 

TCGCGATGGA CGAAGGTCTA CGCTTCGC 82 8 



55 

2) INFORMATION FOR SEQ ID NO: 676 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 98 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Leishmania enriettxi 

(B) STRAIN: ATCC 50120 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676 

CTTCTCGGAT GGCGTGCCGC CGGTGCTGAC GGCGCTGGAC GTGACGGAGG 50 

SctSggcg CGACGAGCCG CTGACGCTAG AGATCGTGCA ACACTTGGAC 100 

GCGCACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TACTGAAGCT 150 

is gSgtcgaag GTCGTGTCGA CCGGCGGGAA CATCTCTGTG CCTGTGGGCC 200 

GCGAGACGCT TGGCCGCATC TTCAACGTAC TGGGCGACGC GATTGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CGGAGGCGCC 300 

§2ScTCGCG GAC CAGGCGG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

Sa^tS?CGA TCTGATCCTG CCGTACTGCA AGGGCGGCAA GATCGGTCTG 400 

T^SScGG?G CCGGTGTGGG GAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTCGCC GGCGTTGGGG 500 

AGCGCACCCG CGAGGGGACG GATCTGTACC TGGAGATGAT GCAATCGAAG 550 

GTGATCGACC TGAAAGGTGA GTCGAAGTGT GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCCGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CAGCGCTGCT 750 

GgScCGCATC CCCGCCGCTG TGGGCTACCA ACCGACGCTT GCGGAGGATC 800 

TCGGCATGTT GCAGGAGCGC ATCACGTCGA CGACGAAGGG GTCGATCACG 850 

TCTGTGCAGG CGGTGTACGT GCCTGCGGAT GATATCACGG ATCCCGCGCC 900 

, n QGCGACGACA TTCTCGCACC TGGACGCGAC GACGGTGCTG GACCGCGCGG 950 

?gS2Sgag?c TGGCATCTAC CCCGCGGTCA ACCCACTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCTGATGT GATCGACGTG GATCACTACA ACGTTGCGCA 1050 

GGACATTGTT CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

rGOTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGTGCGCGCA AGGTGACCCG GTTCCTGTCA CAGCCGTTCC AGGTTGCGGA 12 0 0 

GGTGTTTACT GGCATGACAG GCCATTACGT ACAGCTGAGC GACACGGTGG 125 0 

AGTCGTTCTC CGGTTTGCTG ATGGGGTCGT ACGACCAGAT TCCGGAGA 1298 



20 



25 



35 



40 

2) INFORMATION FOR SEQ ID NO: 677 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1083 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microtti 

(B) STRAIN: Persing-1 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677 

CAAGCTCAAG TCTGAGCGTG AGAGAGGTAT TACTATTGAC ATTACTCTCT 50 

GGAAATTTGA GACCCAGAAA TACGAGTACA CTGTCATAGA CGCACCTGGT 100 

CATCGTGACT TTATCAAAAA TATGATTACT GGGACTT C AC AAGCCGACGT 150 

60 TGCTATGCTT GTCGTTCCTG CTGAATCTGG CGGATTCGAG GCTGCTTTTT 200 
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CCAAAGAAGG TCAGACCCGT GAACACGCCT TACTAGCCTT CACACTTGGC 250 

GTCAAACAGA TGATTGTTGC TATTAACAAA ATGGATTCTT GTCAGTACAA 3 00 

GGAGGATCGT TATATGGAAA TTTTCAAGGA AGTACAGCAG TACTTGAAGA 350 

AGGTGGGTTA CAAAGTTGAA AGCGTGCCGT TTGTTGCTAT TTCAGGATTC 4 00 

5 CACGGTGACA ACATGGTTGA AAAATCTACT AACATGCCTT GGTATAAGGG 4 50 

TAAGACCCTC GTAGAGGCAC TTGAT CAAAT GGAGCCTCCA AAACGTCCGG 50 0 

TCGAAAAACC TCTTAGATTG CCCCTGCAGT CAGTCTATAA AATTGGAGGT 550 

ATTGGTACGG TACCAGTCGG AAGGGTCGAA ACAGGACAAC TGAAAGCAGG 60 0 

AATGATCATT ACTTTTGCCC CCACTGGTTT GACCACTGAA TGTAAATCTG 650 

10 TTGAAATGCA TCACGAGGTT GTGGAAGTGG CTAGCCCCGG TGATAACGTT 70 0 

GGATTTAATG TCAAGAATGT GTCTGTTAAG GATATTAAGA GAGGAAATGT 75 0 

GGCTTCGGAT TCGAAAAATG ACCCAGCCAA GGAAGCTACC TCTTTCTCTG 800 

CACAAGTCAT TGTACTCAAT CACCCTGGTA CCATCAAGGC CGGTTACTCA 85 0 

CCTGTGGTTG ATTGCCATAC TGCCCACATT GCTTGCAAAT TCGAATCTCT 90 0 

15 AGACACTAGG ATTGACAAGC GTACTGGCAA GACTTTGGAA GAAAATCCTA 95 0 

AGACTATTAA GAATGGTGAC GCTGCCATGG TGACTATGAA ACCAAATAAA 1000 

CCCATGGTTG TGGAAACTTT CACCGACTAC GCCCCGTTGG GCCGGTTCGC 105 0 

CGTGCGTGAT ATGCGCCAAA CCGTTGCCGT CGG 1083 



20 



2) INFORMATION FOR SEQ ID NO: 678 



(i) SEQUENCE CHARACTERISTICS: 
2 5 (A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Lev- 12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678 



35 



TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCT CACGGTGGTT 50 

ACTCTGTCTT CACTGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

4 0 TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTCTT CAGGCTAATT AGTCGATGAC 2 00 

GTGGGCCCTG ACTAAAACTG TTTCTTCCAG TCTTCGGTCA GATGAACGAG 2 50 

CCCCCTGGAG CCCGTGCCCG AGTTGCCCTT ACTGGTTTGA CCATTGCCGA 3 00 

GTACTTCCGT GACGAGGAAG GCCAGGATGT GTTGCTTTTC ATTGACAACA 3 50 

45 TTTTCCGATT CACCCAGGCC GGTTCCGAGG TGTCTGCCTT GCTCGGTCGT 4 00 

ATCCCCTCTG CCGTCGGTTA CCAGCCCACT CTTTCCACCG ACATGGGTGG 4 50 

TATGCAGGAG CGAATGTAGG TTGCATTCTC TGTGATTTTA CGGCAAGCCT 500 

TGACTTTTTT TTTCTAGTAC CACCACCAAG AAGGGTTCCA TTACCTCCGT 550 

C 551 

50 

2) INFORMATION FOR SEQ ID NO: 679 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679 

TCTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

m SSSSctt CACCGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TaSSaXI TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTGGG AGCTAATTAG TCGATGACGT 200 

GgSStGAC CAAAACCGTT TCTTTCAGTC TTCGGTCAGA TGAACGAGCC 250 

CCCT^SScC CGTGCCCGAG TTGCTCTTAC TGGTTTGACC ATTGCCGAGT 3 00 

IS AOTCOGTOA CGAGGAAGGC CAGGATGTGT TGCTTTTCAT TGACAACATT 350 

t5Sga??2a CCCAGGCCGG TTCCGAGGTG TCTGCCTTGC TCGGTCGTAT 400 

ScCTCTOCC GTCGGTTACC AGCCCACTCT TTCCACCGAT ATGGGTGGTA 450 

TGCAAGAGCG AATGTAAGTT GCATTTTTTG TGATTTTACG GCAAGTCTTG 500 

IcraCATTT TTTTCTAGCA CCACCACCAA GAAGGGTTCC ATTACCTCCG 550 
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2) INFORMATION FOR SEQ ID NO: 680 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1018 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

30 (D) TOPOLOGY: Linear ' 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680 

TCTCCCTGCT ATTTTAAACG CTCTTGAAGT TAAGGATCAT GCTGGTGGTC 50 

GTCTTGTTCT TGAAGTTGCT CAACACTTGG GTGAAAACAC TGTACGTACT 100 

ATTGCTATGG ATGGTACTGA AGGTAAGTTT ATTTTTAGAT GATCATAAAT 150 

AATTGATCAT AATGATAAAA AAAAAAGAAG AAGAAGAACA GGATGTATAT 200 

AATGGTTAAT AAATAATATT TTCATATTGK ATATAACTAT TTAATCTGTT 250 

TTTTTTTCTT CATGATTATA TATATATATG TMCTAATATC TAATATGAAC 300 

PTTTTTTATA AAATTAATCA GGTCTTGTCC GTGGTCAAAA GGTTGTTGAT 350 

ACTGGTGCTC CTATTACCAT TCCTGTTGGT AAGGAAGTTC TTGGTCGTAT 400 

c5?cScGTT ATTGGTGAAC CCATTGATGA ACGTGGTCCT ATTAAGGCCA 450 

StCTCACCG TGCTATTCAC GCTGAAGCTC CAGAATTCGT TGATCAATCT 500 

<50 CCTACTCCCG AAATTC TTGA AACTGGTATT AAGGTTGTCG ATTTATTAGC 550 

TCCTTATGCT CGTGGTGGTA AGATTGGTCT TTTCGGTGGT GCTGGTGTAG 600 

GTAAAACTGT CTTGATTCAA GAACTTATTA ACAACATTGC TAAAGCCCAT 650 

GrTGGTTACT CTATTTTCTG TGGTGTTGGT GAACGTACTC GTGAAGGTAA 700 

CGATTTATAC CACGAAATGA TGGAAACTGG TGTCATTAAA CTTGAAGGTG 750 

55 ACTCCAAGTG TGCTCTTGTA TTCGGTCAAA TGAACGAACC TC CTGGTGCT 80 0 

CGTGCCCGTG TTGCTTTAAC TGGTTTAACC ATTGCTGAAT ACTTCCGTGA 850 

TGAAGAAGGT CAAGATGTGT TACTTTTCAT TGATAACATT TTCCGTTTCA 900 

CTCAAGCTGG TTCTGAAGTA TCTGCCCTTT TAGGTCGTAT TCCATCTGCT 950 

GTAGGTTACC AACCCACTTT ATCTACTGAT ATGGGTGGTA TGCAAGAACG 1000 
60 TATTACTACT ACCAAGAA 



40 



45 
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2) INFORMATION FOR SEQ ID NO: 681 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681 
GGISSITTYG GIISIGGIAA RAC 23 



15 



20 2) INFORMATION FOR SEQ ID NO: 682 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682 

GTIACIGGYT CYT CRAARTT ICCICC 2 6 



35 



45 



50 



2 ) INFORMATION FOR SEQ ID NO: 68 3 



(i) SEQUENCE CHARACTERISTICS: 
(A) . LENGTH: 26 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 
GTIACIGGIT CISWIAWRTC ICCICC 2 6 

2) INFORMATION FOR SEQ ID NO: 684 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 32 67 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropical is 
(C) ACCESSION NUMBER: M64984 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 

ATGGCTGGTG CTTTAGAAAA CGCAAGAAAA GAAATTAAAC GTCTTTCATT 50 

agmScacc AATGAATCCC AATATGGTCA AATCTATTCT GTTTCCGGTC 100 

in CGGTTGTTAT TGCCGAAAAC ATGATTGGAT GTGCCATGTA CGAATTGGTT 150 

SScttggtc ATGATAATTT AGTTGGGGAA GTTATTAGAA TTAATGGTGA 200 

SJSSSSSot ATTCAAGTTT ATGAAGAAAC TGCAGGGGTC ACTGTTGGTG 250 

It^AGTTTT AAGAACTGGT AAACCATTAT CTGTTGAATT AGGTCCTGGT 300 

TTAATGGAAA CTATTTATGA TGGTATTCAA AGACCTTTAA AAGCCATTAA 350 

15 AGATGAATCC CAATCTATTT ATATCCCAAG AGGTATTGAT GTTCCTGCTT 400 

TATCAAGAAC TGTTCAATAT GATTTCACTC CAGGTCAATT GAAAGTTGGT 450 

GATCATATCA CTGGTGGGGA CATTTTTGGT TCTATTTATG AAAACTCTTT 500 

ATTGGATGAC CATAAGATTT TGTTACCTCC AAGAGCAAGA GGTACTATTA 550 

CTTCTATTGC TGAAGC CGGT TCTTATAATG TTGAAGAAC C AGTTTTGGAA 600 

2 0 GTTGAATTTG ATGGTAAGAA ACATAAATAC TCTATGATGC ATACATGGCC 650 

AGTTAGAGTT CCAAGACCAG TTGCTGAAAA ATTGACTGCT GATCATCCAT 700 

TGTTGACCGG TCAAAGAGTC TTGGATTCTT TATTCCCATG TGTTCAAGGT 750 

GGTACTACTT GTATCCCAGG GGCTTTTGGT TGTGGTAAAA CTGTTATTTC 800 

TCAATCTTTG TCCAAATTCT CCAACTCTGA TGTTATTATC TATGTTGGTT 850 

2 5 GTTTCACTAA AGGTACTCAA GTCATGATGG CTGATGGTGC CGACAAATCT 900 

ATTGAATCTA TTGAAGTTGG TGACAAAGTC ATGGGTAAAG ATGGTATGCC 950 

AAGAGAAGTT GTTGGCTTAC CAAGAGGTTA TGATGATATG TACAAGGTTC 1000 

GTCAACTTTC TAGTACTAGA CGTAATGCTA AATCCGAAGG CTTGATGGAT 1050 

TTCACTGTTT CTGCTGATCA TAAACTTATC TTGAAAACTA AACAAGATGT 1100 

3 0 CAAGATTGCT ACACGTAAAA TTGGTGGCAA CACCTATACT GGTGTTACTT 1150 

TCTATGTTTT GGAAAAGACT AAGACTGGTA TTGAATTAGT TAAAGC CAAG 1200 

ACTAAAGTTT TCGGTCATCA TATCCATGGT CAAAATGGCG CTGAAGAAAA 1250 

AGCTGCTACT TTTGCTGCTG GCATTGACTC TAAAGAATAC ATTGATTGGA 1300 

TCATTGAAGC TAGAGATTAT GTACAAGTTG ATGAAATTGT CAAGACCAGC 1350 

35 ACCACTCAAA TGATCAACCC AGTTCATTTT GAAT CTGGTA AACTCGGTAA 1400 

CTGGTTACAC GAACACAAGC AAAACAAATC ACTTGCTCCA CAATTGGGTT 1450 

ACTTGTTGGG TACTTGGGCT GGTATTGGAA ATGTTAAATC TTCTGCTTTC 1500 

ACCATGAACT CCAAAGATGA TGTTAAATTA GCTACAAGAA TTATGAACTA 1550 

CTCTTCAAAA TTGGGCATGA CTTGTTCTTC TACTGAATCC GGTGAACTCA 1600 

4 0 ATGTCGCTGA AAACGAAGAA GAATTTTTCA ATAACCTTGG TGCTGAAAAG 1650 

GATGAAGCTG GTGATTTCAC TTTTGATGAA TTTACCGATG CTATGGATGA 1700 

ATTGACTATC AATGTTCATG GTGCAGCTGC AAGCAAGAAG AACAATTTGT 1750 

TGTGGAATGC TTTGAAATCT CTTGGTTTCA GAGCCAAGTC TACTGATATT 1800 

GTCAAGAGTA TTCCTCAACA TATTGCTGTT GATGATATTG TTGT CAGAGA 1850 

45 ATCTTTGATT GCCGGTTTAG TTGATGCTGC TGGTAATGTT GAAACCAAAT 1900 

CCAATGGTTC TATTGAAGCT GTTGTTAGAA CTTCTTTCAG ACATGTCGCT 1950 

AGAGGTCTTG TCAAGATTGC TCATTCTTTG GGTATTGAAT CATCTATTAA 2000 

TATTAAAGAT ACTCACATTG ATGCTGCTGG TGTTAGACAA GAATTTGCTT 2050 

GTATTGTCAA TTTGACTGGT GCTCCACTTG CTGGTGTTCT TTCTAAATGT 2100 

50 GCACTTGCAA GAAACCAAAC TCCAGTTGTC AAATTTACCA GAGACCCAGT 2150 

TTTGTTCAAC TTTGATTTGA TCAAATCTGC AAAAGAAAAC TATTATGGTA 2200 

TTACTTTGGC TGAAGAAACT GATCATCAAT TCCTTTTATC CAACATGGCC 2250 

TTGGTGCACA ACTGTGGTGA ACGTGGTAAT GAGATGGCTG AAGTTTTGAT 2300 

GGAATTCCCA GAATTGTTTA CTGAAATTTC TGGTAGAAAA GAACCAATTA 2350 

55 TGAAACGTAC CACTTTGGTT GC CAATACTT CTAATATGCC AGTCGCTGCC 2400 

AGAGAAGCTT CTATTTATAC TGGTATTACA TTGGCTGAAT ATTTCAGAGA 2450 

TCAAGGTAAG AATGTTTCTA TGATTGCTGA TTCTTCTTCA CGTTGGGCTG 2500 

AAGCTTTGAG AGAAATTTCT GGTAGATTGG GTGAAATGCC TGCTGATCAA 2550 

GGTTTCCCAG CTTATTTGGG TGCTAAATTG GCTTCTTTCT ATGAGCGTGC 2600 

60 CGGTAAAGCC ACTGCTTTGG GTTCACCAGA TAGAGTTGGT TCAGTTTCTA 2650 
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TTGTTGCTGC TGTTTCTCCA GCTGGTGGTG ATTTCTCTGA TCCAGTTACT 27 0 0 

ACTTCTACTT TGGGTATTAC TCAAGTTTTC TGGGGGTTGG ATAAGAAATT 2 750 

GGCCCAAAGA AAACATTTCC CATCTATTAA CACCAGTGTT TCTTATTCTA 2800 

AATACACCAA TGTTTTGAAC AAATACTATG ATTCCAACTA TCCAGAATTC 2850 

5 CCACAATTGA GAGACAAAAT TAGAGAAATT TTATCTAATG CTGAAGAATT 2 90 0 

GGAACAAGTT GTTCAATTAG TTGGTAAATC TGCATTGTCT GATTCTGATA 2 95 0 

AGATTACTTT AGATGTTGCT ACCTTGATTA AAGAAGATTT CTTGCAACAA 3 00 0 

AATGGTTATT CTTCATATGA TGCATTCTGT CCAATTTGGA AGACTTTTGA 3 050 

TATGATGAGA GCATTTATTT CATATTATGA TGAAGCACAA AAAGCAATTG 3100 

10 CCAATGGTGC TCAATGGTCT AAATTAGCTG AAAGTACTAG TGATGTTAAA 3150 

CATGCTGTTT CTTCAGCTAA ATTCTTTGAA CCATCAAGAG GTCAAAAAGA 32 00 

AGGTGAAAAA GAATTTGGAG ATTTATTAAC CACTATCTCC GAAAGATTTG 3250 

CTGAAGCTTC AGAATAA 32 67 



15 



2) INFORMATION FOR SEQ ID NO: 685 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 9790 

30 (C) ACCESSION NUMBER: D17462 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685 

TTGCAAATTG GAAAAATCAT AAAAGTCTCC GGTCCTCTCG TTATGGCAGA 50 

35 AAATATGTCA GAAGCAAGTA TTCAAGACAT GTGTTTAGTG GGAGATTTAG 100 

GAGTCATCGG CGAAATCATT GAGATGCGTC AAGATGTGGC GTCTATTCAA 150 

GTATATGAAG AAACTTCAGG AATTGGTCCC GGAGAAC CTG TTCGTTCCAC 200 

TGGGGAAGCA CTATCTGTTG AGCTAGGACC AGGAATCATT TCACAAATGT 250 

TTGACGGGAT TCAAAGACCA CTGGATACAT TTATGGAAGT GACTCAAAGT 300 

40 AACTTCTTAG GACGTGGGGT CCAATTACCA GCTTTAGATC ATGAGAAACA 3 50 

ATGGTGGTTT GAAGCGACAA TCGAAGAAGG AACAGAAGTA AGTGCTGGAG 400 

ACATCATTGG GTACGTGGAT GAAACGAAGA TCATTCAGCA CAAAATCATG 4 50 

GTCCCTAATG GTATCAAAGG AACTGTACAA AAAATTGAAT CTGGATCATT 5 00 

TACGATCGAT GATCCGATTT GTGTGATCGA AACGGAACAA GGCTTAAAAG 550 

45 AGCTGACGAT GATGCAAAAA TGGCCAGTAC GTCGTGGTCG AC CAATCAAA 60 0 

CAAAAATTAA ATCCAGATGT ACCGATGATC ACCGGTCAAA GGGTCATTGA 650 

CACGTTTTTC CCAGTAACTA AAGGAGGAGC GGCAGCCGTT CCAGGTCCGT 700 

TTGGTGCAGG GAAGACAGTT GTGCAACACC AGATTGCTAA GTGGTCGGAC 75 0 

GTAGATCTAG TGGTTTACGT TGGTTGTGGG GAACGAGGAA ATGAAATGAC 800 

50 GGATGTCGTC AATGAATTTC CTGAACTGAT CGATCCAAAT ACAGGCGAGT 850 

CTTTGATGGA ACGAACTGTG TTGATCGCTA ATACATCGAA CATGCCAGTA 900 

GCTGCTCGAG AAGCTTCTAT TTATACGGGA ATCACGATTG CCGAGTACTT 95 0 

CCGTGACATG GGGTATGATG TAGCAATCAT GGCAGATTCC ACTTCTCGTT 10 00 

GGGCAGAAGC ACTGCGTGAA ATGAGCGGAC GTTTAGAAGA AATGC CTGGT 105 0 

55 GATGAAGGTT ATCCCGCTTA TCTGGGCTCT CGTCTAGCTG AATACTATGA 110 0 

ACGTTCAGGA CGTGTCATTG CTCTAGGCTC TGACCAACGT GAGGGCAGTA 1150 

TCACTGCCAT CAGTGCGGTT TCTCCTTCTG GTGGAGATAT CTCTGAACCA 12 00 

GTGACTCAAA ATAC CTTACG TGTGGTGAAG GTTTT CTGGG GATTAGATTC 12 50 

TAGTCTTGCT CAAAAAAGAC ATTTTCCATC GATTAACTGG ATCCAAAGTT 13 00 

60 ACTCATTATA TTCAACAGAA GTTGGCAGAT ATATGGATCA AATCTTACAA 1350 
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CAGGATTGGT CTGATATGGT 
AGAACAATTA AATGAAATCG 
ATAACGATCG CTTGACCCTT 
TTACAACAAA ATGCTTTTGA 
5 ACAATTCAAC ATGTTGAAAG 
AAGCCTTATC GTTGGGAGCG 
GCGGTCAGAG AACGCATTAG 
AGCCAAAATT AGTAGTATAA 
TTGTTTCAGA AGGAGGGATG 



AACTGAAGGC ATGCGGATCT 
TGCGCTTGGT AGGGATCGAT 
GAAGTAGCAA AATCGATTCG 
TGATGTAGAT ACGTTTACTT 
TTATTTTGAC TTTTGGGAAA 
TACTTCAATG AAATCATGGA 
TCGGAGCAAG TATATTCCAG 
ATGAAGAAAT CAAAGAAACG 
ACCGATGATT AA 



TGCAAGAAGA 14 00 

TCGCTTTCTG 1450 

AGAAGACTAT 1500 

CAAGAGAAAA 1550 

GAAGCTCGAA 1600 

AGGTACAGTA 1650 

AAGAAGAGTT 17 00 

ATC CAATTGA 1750 
1782 



2) INFORMATION FOR SEQ ID NO: 686 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1781 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686 



CAGGGACATG TTATAGAAGC TTATGGAAAC TTGTTACGTG TACGCTTTGA 50 

CGGATATGTT AGACAAGGTG AAGTTGCATA TGTCAACGTA GATAATACCT 100 

-*0 GGTTAAAAGC AGAAGTGATT GAAGTTGCTG AT CAAGAAGT CAAGGTTCAG 150 

ctatttSag ATACACAAGG CGCGTGTCGA GGAGCTCTTG TTACGTTTTC 200 

AGGAC AT CTT TTAGAAGCCG AGTTAGGGCC TGGCTTGCTT CAGGGCATTT 250 

TCGATGGACT TCAAAATCGT CTTGAGGTGC TAGCTGAAGA TAGTTCTTTC 3 00 

TTGCAGAGAG GCAAGCATGT TAATGC TATT TCTGATCATA ATTTATGGAA 3 50 

-a c TTATACTCCC GTAGCTTCTG TTGGGGATAC TTTAAGACGA GGAGATCTTC 4 00 

TAGGAACAGT ACC TGAAGGA CGATTTACTC ATAAGATTAT GGTTCCTTTT 450 

TCTTGCTTTC AAGAGGTTAC CCTGACTTGG GTAATTTCTG AAGGAAC CTA 500 

TAATGCTCAT ACTGTGGTCG CAAAAGCTCG AGATGCTCAG GGTAAAGAAT 550 

GTGCCTTTAC TATGGTGCAA AGATGGC CGA TCAAACAAGC TTTTATTGAA 600 

4 0 GGAGAGAAGA TCCCTGCGCA TAAGATTATG GATGTGGGTT TGCGAATCTT 650 

AGATACGCAA ATTCCAGTAT TGAAGGGGGG AACTTTCTGT ACCCCAGGAC 700 

CTTTTGGTGC AGGGAAAACA GTCTTACAAC AC C ATCTTTC TAAGTACGCT 750 

GCTGTAGATA TTGTGATTTT GTGTGCGTGC GGAGAGCGTG CTGGTGAAGT 8 00 

TGTTGAGGTA TTACAAGAGT TCCCTCATCT TATCGACCCC CATAC CGGAA 850 

45 AGTCTTTAAT GCACAGAACA TGTATTATTT GTAACACATC ATCCATGCCT 900 

GTGGCTGCCC GAGAGTCTTC GATCTATTTA GGAGTGACGA TTGCAGAATA 950 

CTATCGCCAG ATGGGACTAG ATATTCTGCT TTTAGCTGAT TCTACATCCC 100 0 

GATGGGCACA AGC CCTTAGA GAGATTT CGG GACGTCTTGA AGAAATCCCT 10 50 

GGAGAGGAAG CATTTCCTGC ATACCTGTCT TCTAGAATAG CTGCTTTTTA 1100 

50 TGAGCGAGGA GGAGCTATCA CCACGAAAGA TGGTTCTGAA GGATCTTTAA 1150 

CTATATGTGG TGCGGTGTCT CCTGCAGGAG GAAACTTTGA AGAACCAGTC 12 0 0 

ACTCAATCTA CATTAGCTGT AGTCGGAGCG TTCTGTGGTC TTTCAAAAGC 12 50 

ACGAGCTGAC GCACGTAGGT ATCCTTCAAT AGACCCTTTG ATTTCTTGGT 1300 

CAAAATATTT GAAC CAGGTA GGACAAATTT TAGAAGAGAA GGTTTCAGGC 1350 

5 5 TGGGGTGGTG CTGTGAAAAA AG C AG C A CAG TTTCTAGAGA AAGGTTCAGA 14 0 0 

AAT CGGCAAG CGTATGGAAG TTGTCAGTGA AGAAGGGGTT TCTATGGAAG 1450 

ACATGGAAAT CTACTTAAAG GCAGAAC TTT ATGATTTTTG TkATCTCCAG 150 0 

CAGAACGCAT TCGATCCTGT GGACTGTTAT TGTCCTTTTG AGAGACAGAT 1550 

AGAGTTATTT TCATTAATCA GTCGTATTTT TGATGCTAAA TTTGTTTTTG 16 0 0 

60 ATAGTCCTGA TGATGCAAGA AGCTTTTTCC TTGAGCTGCA GAGCAAGATT 1650 
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AAGACATTAA ATGGCCTGAA ATTTCTTTCA GAGGAATATC ATGAGAGTAA 1700 
AGAGGTCATA GTTAGACTGT TGGAAAAAAC AATGGTACAA ATGGCGTAAG 1750 
GATATGCAAA CAATCTACAC AAAAATAACT G 17 81 



2) INFORMATION FOR SEQ ID NO: 687 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1758 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Halobacterium salinarum 
(C) ACCESSION NUMBER: S56356 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 



ATGAGTCAAG CTGAAGCAAT CACTGACACC GGCGAAATCG AGAGCGTGAG 50 

CGGTCCGGTC GTGACCGCCA CGGGCCTCGA CGCGCAGATG AACGACGTCG 100 

25 TCTACGTGGG CGACGAGGGT CTGATGGGCG AGGTCATCGA GATCGAAGGC 150 

GACGTAACCA CCATC CAGGT CTACGAGGAG ACGTCCGGCA TCGGGCCGGG 2 00 

CCAGCCCGTC GACAACACGG GCGAACCGCT CACCGTGGAC CTGGGCCCGG 2 50 

GGATG CTGGA CTCCATCTAC GATGGTGTGC AGCGTCCACT GGACGTCCTC 3 00 

GAAGACGAGA TGGGGG CGTT CCTCGATCGC GGTGTTGACG CACCCGGCAT 3 50 

30 CGACCTCGAC ACCGACTGGG AGTTCGAGCC CACCGTCGAG GCGGGCGACG 4 00 

AGGTCGCGGC CGGCGATGTC GTCGGCACCG TCGACGAAAC GGTCAGCATC 450 

GAACACAAGG TTCTGGTGCC CCCCCGCAGT GACGGCGGCG AAGTCGTCGC 50 0 

CGTCGAATCC GGCACGTTCA CCGTCGACGA CACGGTCGTC GAGTTGGACA 550 

CCGGCGAGGA GATCCAGATG C AC CAGGAGT GGCCGGTCCG CCGCCAGCGC 600 

3 5 CCCACCGTCG ACAAGCAGAC GCCGACGGAG CCGCTGGTGT CCGGCCAGCG 650 

CATCCTCGAC GGCCTGTTCC CGATCGCGAA AGGCGGGACG GCCGCGATCC 7 00 

CGGGGCCGTT CGGGTCCGGG AAGACGGTCA CCCAGCAGTC CCTCGCGAAG 750 

TTCGCCGACG CGGACATCGT TGTCTACATC GGCTGTGGTG AGCGCGGCAA 80 0 

CGAGATGACG GAAGTCATCG AGGACTTCCC GGAGCTGCCC GACCCCCAGA 850 

4 0 CCGGGAACCC GCTGATGGCC CGCACCACGC TCATCGCGAA CACGTCGAAC 90 0 

ATGCCGGTTG CCGGGCGTGA GTCCTGCATT TACACGGGAA TCACCATCGC 950 

GGAGTACTAC CGCGACATGG GCTACGACGT GGCGCTGATG GCCGACTCCA 10 00 

CCTCGCGGTG GGCGGAGGCC ATG CGGGAGA TCTCCTCGCG ACTCGAGGAG 1050 

ATGCCCGGCG AGGAGGGGTA TCCCGCGTAC CTGGCCGCCC GCCTCTCGGA 1100 

4 5 GTTCTACGAG CGCGCCGGCT ACTT CGAGAA CTTCAACGGG ACCGAGGGCT 1150 

CCATCTCGGT CATCGGTGCG GTGTCGCCGC CCGGCGGGGA CTTCTCCGAG 12 00 

CCGGTCACCC AGAACACGCT GCGCATCGTG AAGACGTTCT GGGCGCTTGA 12 50 

CTCGGACCTC GCCGAGCGCC GGCACTTCCC GGCGATCAAC TGGGACGAGT 13 00 

CCTACAGCCT CTACAAGGAC CAACTCGACC CGTGGTTCAC GGACAACGTC 13 50 

5 0 GTCGACGACT GGGCCGAGCA GCGCCAGTCG GCGGTCGACA TCCTCGACGA 14 00 

GGAATCCGAA CTCGAAGAGA TCGTGCAGCT CGTCGGGAAG GACGCGCTGC 14 50 

CCGAGGACCA GCAGCTCACG CTGGAAGTCG CGCGGTACAT CCGCGAGGCG 1500 

TGGCTCCAGC AGAACGCGCT CCACGACGTG GATCGCTACT GCCCGCCCGA 15 50 

GAAGACGTAC GCCATCCTCT CCGGCATCAA GACGCTTCAC GAGGAGTCCT 16 0 0 

55 TTGAGGCGTT GGACGC CGGT GTGCCAGTCG AGGAGATCAC GTCGATCGAC 16 50 

GCCGCGCCGC GCCTGAATCG TCTCGGCACG ACGCCCGACG ACGAGCACGA 17 0 0 

GGCGGAGGTC GCGGAGATCA AACAGCAGAT TAC CGAGCAG CTTCGGGAGC 1750 

TCTACTGA 1758 

60 
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2) INFORMATION FOR SEQ ID NO: 688 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 3118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human 

(C) ACCESSION NUMBER: L09234 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 88 



GAATTCCGGC AGCTGACTAG TCTTGTGATT GGGGTCCTGG GCTGATAAAA 50 

TCATTCCAAA TGACGAGCAC ATTGATAAAG ACGTCCGATG AGGACCGGGA 100 

9fi GTCCAAATTC GGCTTTGTTT TTGCCGTATC TGGACCTGTG GTGACAGCTG 150 

AACGAATGGC CGGTTCTGCT ATGTACGAAC TGGTGCGTGT CGGTTATTAT 2 00 

GAACTGGTCG GAGAGATCAT CCGGTTGGAG GGTGACATGG CAACAATCCA 2 50 

AGTATACGAA GACAC CTC AG GTGTGACAGT AGGCGATCCC GTGCTGCGCA 3 00 

CAGGCAAGCC GCTGTCCGTG GAACTGGGAC CCGGAATCAT GGGCAGCATC 3 50 

2 5 TT CGACGGT A TCCAGCGACC GCTGAAGGAT ATCAATGAAC TGTCAAATAG 4 00 

TATCTACATC CCGAAAGGTG TCAATGTGCC TGCCCTGAGT CGCACTGCAC 4 50 

AGTGGGAC TT CAGTCCCGTC AGTGTCAAGG TTGGAAGCCA CATTACTGGT 500 

GGTGACCTGT ACGGTTTGGT CCACGAAAAT ACTCTGGTGA AACACAAGTT 550 

GCTGCTGCCG CCCCGTGCCA AGGGAACTGT CACGTACATT GCAGAACCTG 600 

10 GAAACTACAC AGTTGATGAT GTTGTCCTGG AGACAGAATT TGACGGCGAG . 650 

CGATCAAAGT TCACCATGCT GCAAGTGTGG CCTGTACGTC AGCCCAGGCC 700 

TGTTACAGAA AAGTTGCCAG CTAACTACCC CCTCCTTACT GGCCAGCGTG 750 

TGCT CGACTC CCTATTCCCG TGTGTCCAGG GTGGAACAAC AGCTATTCCT 800 

GGGGCCTTCG GATGTGGCAA GACTGTAATA TCACAGTCTT TGTCAAAATA 850 

3 5 CTCAAACTCC GATGTAATTA TCTATGTAGG TTGTGGTGAG CGAGGTAATG 90 0 
J ~ AAATGTCAGA AGTACTCAGG GATTTCCCGC AGTTGTCGTT GGAGATTGAT 95 0 

GGTGTGACTG AAT CAATCAT GAAGAGAACA GCCCTGGTCG CAAACACATC 10 00 

AAACATGCCT GTGGCTGCTC GAGAAGCATC TATCTACACA GGTATTACAC 105 0 

TGTCAGAATA CTT CAGGGAC ATGGGTTACA ATGTATCCAT GATGGCTGAC 1100 

4 0 TCAACTTCAC GATGGGCCGA AGCTCTTCGA GAAATCTCAG GTCGATTGGC 115 0 

TGAAATGCCT GCCGACAGCG GTTATCCCGC CTACCTAGGT GCACGACTTG 120 0 

CCAGTTTCTA CGAGCGTGCC GGCCGTGTGA AGTGCTTGGG TAAC CCAGAC 1250 

AGGGAGGGCT CCGTGAGTAT AGTGGGCGCC GTGTCGCCGC CCGGTGGAGA 13 0 0 

CTTCTCAGAT CCCGTGACGA CGGCCACACT AGGTATCGTC CAGGTGTTCT 1350 

4 5 GGGGTCTCGA CAAGAAACTT GCCCAGCGAA AGCACTTCCC ATCCATCAAC 140 0 

TGGCTCATCT CGTACAGTAA-* ATACATGCGT GCTCTGGATG ACTTCTACGA 1450. 

CAAGAATTTC CCAGAGTTTG TCCCACTGCG TACAAAGGTG AAGGAGATTT 1500 

TGCAGGAGGA AGAAGAC CTG TCTGAAATTG TGCAGTTGGT CGGTAAAGCT 1550 

TCATTGGCAG AAACTGACAA GATCACACTT GAGGTTGCCA AACTATTAAA 1600 

50 GGATGATTTC CTGCAACAGA ACAGCTATTC ACCATATGAC CGTTTCTGCC 1650 

CATTCTACAA GACAGTAGGA ATGCTGAAAA ATATGATTGC TTTCTACGAT 170 0 

ATGTCTCGGC ATGCAGTTGA ATCTACTGCT CAGAGCGAGA ACAAGATCAC 1750 

TTGGAATGTT ATTAGAGATT CTATGGGCAA TATT CTGTAT CAGCTTTCCT 1800 

CCATGAAATT CAAGGATCCA GTCAAGGATG GAGAAGCGAA GATCAAGGCA 1850 

55 GACTTTGAGC AGCTTCATGA AGACATT CAG CAAGCCTTCA GGAAC CTGGA 190 0 

GGATTAAAGT GGTAGCTGCC AGTGGTTCTC TCGGTGCAGT TGTCACATTT 1950 

GGCAAGCTCT GTAGGGTTGC CGAGTGGCAT CGGTGCTAGA CACCTGAGCA 2 000 

TTCCTTTGCC ACATAAAGAC TAAAGCAGGT GGAATTTCAG TTGTAAAAAG 2 050 

CTGGTTCCAT TGGTGCTAAG ATTATGTTGT GCCCTTTTCT GCTTCTCACA 2100 

60 TTCCAACAGA GGAATTTACT TCCAGTTTTC TTCCATTTTC CTCCTCATTT 2150 
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TAAGTGTCGG 
AAGCAGGGAG 
CAGCCTTGTA 
ATTGCCGTCT 
5 CAGAAGGTGA 
AACTGCCATC 
TCCAGTGTTG 
ACCTCACATT 
TGTAGAAAAA 

10 TTTAAAATGG 
TGCATTTTGC 
ACAAATTATG 
TGC CAGTATT 
TTTTTTTTCC 

15 ATGTGTAAAA 
TGACTGTATA 
AAGGTGAAAG 
AAAATCCTAC 
GTGTAGAAAT 

2 0 ATAAATTATG 



TACAGAGGCA 
AAT TTGTAAT 
AATTCATTTG 
GTATGTATGT 
ATTACACCAC 
CATTGTCCTA 
CAAATTGTGT 
GTAACTCAGT 
ATAATTGTTG 
TGAATGTATT 
TTAATGGAGA 
TATATGTGAA 
AATTTAAACA 
TATAAAATTG 
TAGTGATCCC 
GTTGTACAGG 
TCTATTATTG 
TTCTGTGTAT 
ACAGTCAACT 
CCCGGAAT 



ATAATCTGAT 
TATTACAAAT 
TCCCAGGACT 
ACACACCGTA 
TTACTCATTG 
TTTATTCACA 
TTAGAAAATT 
ATTTACACAC 
TATATTGAAA 
TTATATTTCT 
TGTATGTAAA 
ATGGAGATGG 
ACTGTAGTTG 
GTGGATGTAT 
AGTAACTGTA 
TGTTGTTACT 
TATTGTAATG 
AAATGTTACC 
TCATGTTCAT 



AACTCTGTAC 
CCCATTATCT 
CCCTCTTGTG 
CTGCAGTATT 
TGTCACGTAG 
TAACTAGTTT 
ATGCCATCGA 
ACGTTTACTT 
GTACAAGTGA 
TTTGTAGACA 
CCTAAAATAG 
TTTCTAATTT 
TGGGATGTAG 
GTGTCGGAGA 
AAGCTTTAGA 
TTTAAGAATT 
TTTAAAGCAT 
ATTCCTCATA 
TAGCATTTCA 



CGTCACTTAC 220 0 

CTGTGCACCA 22 50 

TGTACGTGAG 23 0 0 

TGAAGTCAGT 2 3 50 

CAAGTGTGCA 24 00 

TCTTTGCATT 2450 

GACTGGTCGA 2500 

GCTACAGAAA 2 550 

CAAAGTTGCA 2600 

CAAGAGTTAA 2650 

GAGTTTGTGC 2 70 0 

GCTGATTGAT 2750 

TGGGAAGATT 2 8 0 0 

TTTTGATTGT 2 8 50 

ATACAGTTAC 2900 

TATTGACACA 2 950 

TTAAGGTTTA 3 000 

TAACATAACT 3 050 

CTGTTGTCAC 3100 
3118 



2) INFORMATION FOR SEQ ID NO: 689 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 183 6 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 5 (A) ORGANISM: Plasmodium falciparum 

(B) STRAIN: 3D7 

(C) ACCESSION NUMBER: L08200 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 689 



ATGACAAAAG TTGCTGTTGA AAAAGAGGAA CCAGGAGTTG TTTATAAGGT 50 

GGCTGGTTCA TTAGTTATTG CTGAGAATAT GAGTGGAACT CGTATGTACG 100 

AGTTAG CTAA AGTAGGATGG AATAAATTGG TTGGAGAAAT TATTAGATTA 150 

GAAGGGAATT ATGCATATAT ACAAGTTTAT GAAGATACTT CAGGTTTATC 200 

45 TGTAGGAGAC CCTGTTATAA AAACAGGAAA TGCTTTATCA GTCGAATTGG 250 

GTCCTGGTAT TTTAGATAAT ATTTATGATG GTATTCAAAG AC CATTAGAA 300 

AGAATAGCAA ATGTGTGTGG TGATGTATAT ATATATAAAG GTATTGATAT 350 

GACATCTTTA GATCATGATA AACAATGGCA ATTTTATGCT GATAAGAAAT 4 00 

TAAAATTAAA TGATATTGTT ACTGGTGGAG ATATCTTTGG ATTTGTTGAT 4 50 

50 GAAAATAAAT TATTTAAAGA ACACAAAATT ATGGCTCCAC CTAATGCTAA 500 

AGGGAGGCTT ACATATATTG CTCCAGATGG ATCATATACT TTAAAAGATA 550 

AAATATTTGA ATTAGAATAT CAAGGAAAAA AATATACATA TGGTTTATCT 600 

CATTTATGGC CTGTTCGTGA TCCTAGACCT GTTTTAGAAA AGGTAACAGG 650 

GGATACTTTA TTATTAACAG GGCAAAGAGT TTTAGATTCG TTATTTCCAA 700 

55 CAGTTCAAGG AGGTACTTGT GCTATTCCTG GTGCATTTGG TTGTGGAAAA 750 

ACTTGTGTTT CTCAGGCCTT ATCAAAATAT TCTAATAGTG AAGTTATTAT 80 0 

ATATGTAGGA TGTGGTGAAA GAGGTAATGA AATGGCTGAA ATTTTATCCG 850 

ACTTTCCTGA ATTAACTACT AAAGTAGATA ATGAAGATGT AGGTATTATG 90 0 

CAAAGAACGT GTTTAGTTGC TAATACTTCT AACATGCCTG TCGCTGCAAG 950 

60 AGAAGCTAGT ATTTATACAG GTATTACTTT ATGTGAATAT TTCCGTGATA 1000 
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TGGGTTATAA TGCTACCATG ATGGCTGATA 
GCCTTAAGAG AAATTTCAGG ACGTTTAGCT 
TTATCCAGCT TATTTAGGTG CTAGATTAGC 
GAAAAGTCAA ATGTATTGGT TCTCCATCTC 
5 GTGGGTGCTG TGTCTCCACC AGGTGGTGAT 
AGCAACCATG TCTATTGTTC AAGCATTTTG 
CTCAAAGAAA ACATTTCCCT TCTGTTAATT 
TATGTCAGAC AATTAGAACA ATACTTTGAT 
ATC TTTAAGA CAAAAAATTA GTGATATTTT 

10 ATGAT AT TGT TCAACTAGTA GGAAAGGATT 
GTTGTTATGG AAGTAGCCAA AATTATTAGA 
TGCATTTAGC GATTATGATT ATATGTGCCC 
TGATGAGAAT TATTTGCCAC TTTTATGCTC 
GAATATGACT CAAGAGAAAG AAAAATTGGT 

15 ATTAAGACCA ACTATAAATA AAATTACACA 
AAAATTCAGA TGAATATTTC AAAAAGTATT 
ATAACAGTAG GTTTAAGAAA CTTGATGGAA 



GTACAAGTAG ATGGGCAGAA 1050 

GAAATG CCTG CAGATAGTGG 1100 

TTCCTTTTAT GAACGTGCAG 115 0 

GTATAGGATC CATTACAATT 12 0 0 

TTCTCTGACC CTGTAACTAC 12 50 

GGGGTTAGAT AAAAAACTAG 13 00 

GGTCTACATC CTTTTCAAAG 13 50 

AATTTTGATC AAGATTTCTT 14 00 

ACAACAAGAA AGTGACTTGA 14 50 

CATTATCAGA AGACCAAAAA 15 00 

GAAGATTTTC TTCAACAAAA 1550 

ATTACAAAAA ACAGTTGGTA 16 00 

AATGCTTAAG AACATTACAA 1650 

TGGGGATCTA TATATAATAC 17 0 0 

TATGAAATTT GAAAACCCAA 1750 

TTAAGGCACT TGAAGAAGAA 18 0 0 

AAATGA 1836 



20 



2 ) INFORMATION FOR SEQ ID NO : 690 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 216 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Saccharomyces cerevislae 

(B) STRAIN: X2180-1A 

(C) ACCESSION NUMBER: J05409 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690 



ATGGCTGGTG CAATTGAAAA CGCTCGTAAG GAAATAAAAA GAATCTCATT 50 

AGAAGACCAT GCTGAATCTG AATATGGTGC CATCTATTCT GTCTCTGGTC 100 

40 CGGTCGTCAT TGCTGAAAAT ATGATTGGTT GTGC CATGTA CGAATTGGTC 150 

AAGGTCGGTC ACGATAACCT GGTGGGTGAA GTCATTAGAA TTGACGGTGA 200 

CAAGGCCACC ATC CAAGTTT ACGAAGAAAC TGCAGGCCTT ACGGTCGGTG 2 50 

ACCCTGTTTT GAGAACAGGT AAGCCTCTGT CGGTAGAATT GGGTCCTGGT 3 00 

CTGATGGAAA CCATTTACGA TGGTATTCAA AGACCTTTGA AAGCCATTAA 350 

45 GGAAGAATCG CAATCGATTT ATATCCCAAG AGGTATTGAC ACT C CAGCTT 400 

TGGATAGGAC TATCAAGTGG CAATTTACTC CGGGAAAGTT TCAAGTCGGC 4 50 

GAT CATATTT CCGGTGGTGA TATTTACGGT TCCGTTTTTG AGAATTCGCT 50 0 

AATTTCAAGC CATAAGATTC TTTTGCCACC AAGATCAAGA GGTACAATCA 550 

CTTGGATTGC TCCAGCTGGT GAGTACACTT TGGATGAGAA GATTTTGGAA 600 

50 GTTGAATTTG ATGGCAAGAA GTCTGATTTC ACTCTTTACC ATACTTGGCC 650 

TGTTCGTGTT CCAAGACCAG TTACTGAAAA GTTATCTGCT GACTATCCTT 70 0 

TGTTAACAGG TCAAAGAGTT TTGGATGCTT TGTTTCCTTG TGTTCAAGGT 750 

GGTACGACAT GTATTCCAGG TGCTTTTGGT TGTGGTAAGA CCGTTATCTC 80 0 

TCAATCTTTG TCCAAGTACT CCAATTCTGA CGC CATTATC TATGTCGGGT 85 0 

55 GCTTTGCCAA GGGTACCAAT GTTTTAATGG CGGATGGGTC TATTGAATGT 900 

ATTGAAAACA TTGAGGTTGG TAATAAGGTC ATGGGTAAAG ATGGCAGACC 950 

TCGTGAGGTA ATTAAATTGC CCAGAGGAAG AGAAACTATG TACAGCGTCG 1000 

TGCAGAAAAG TCAGCACAGA GCCCACAAAA GTGACTCAAG TCGTGAAGTG 1050 

CCAGAATTAC TCAAGTTTAC GTGTAATGCG ACCCATGAGT TGGTTGTTAG 1100 

6 0 AACACCTCGT AGTGTCCGCC GTTTGTCTCG TACCATTAAG GGTGTCGAAT 1150 
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ATTTTGAAGT TATTACTTTT GAGATGGGCC AAAAGAAAGC CCCCGACGGT 12 00 

AGAATTGTTG AGCTTGTCAA GGAAGTTTCA AAGAGCTACC CAATATCTGA 1250 

GGGGC CTGAG AGAGCCAACG AATTAGTAGA ATC CTATAGA AAGGCTTCAA 1300 

ATAAAGCTTA TTTTGAGTGG ACTATTGAGG CCAGAGATCT TTCTCTGTTG 13 50 

5 GGTTCCCATG TTCGTAAAGC TACCTACCAG ACTTACGCTC CAATTCTTTA 14 00 

TGAGAATGAC CACTTTTTCG ACTACATGCA AAAAAGTAAG TTTCATCTCA 14 50 

CCATTGAAGG TCCAAAAGTA CTTGCTTATT TACTTGGTTT ATGGATTGGT 1500 

GATGGATTGT CTGACAGGGC AACTTTTTCG GTTGATTCCA GAGATACTTC 1550 

TTTGATGGAA CGTGTTACTG AATATGCTGA AAAGTTGAAT TTGTGCGCCG 160 0 

10 AGTATAAGGA CAGAAAAGAA CCACAAGTTG CCAAAACTGT TAATTTGTAC 1650 

TCTAAAGTTG TCAGAGGTAA TGGTATTCGC AATAATCTTA ATACTGAGAA 1700 

TCCATTATGG GACGCTATTG TTGGCTTAGG ATTCTTGAAG GACGGTGTCA 1750 

AAAATATTCC TTCTTTCTTG TCTACGGACA ATATCGGTAC TCGTGAAACA 18 00 

TTTCTTGCTG GTCTAATTGA TTCTGATGGC TATGTTACTG ATGAGCATGG 1850 

15 TATTAAAGCA ACAATAAAGA CAATTCATAC TTCTGTCAGA GATGGTTTGG 1900 

TTTCCCTTGC TCGTTCTTTA GGCTTAGTAG TCTCGGTTAA CGCAGAACCT 1950 

GCTAAGGTTG ACATGAATGG CACCAAACAT AAAATTAGTT ATGCTATTTA 2000 

TATGTCTGGT GGAGATGTTT TGCTTAACGT TCTTTCGAAG TGTGCCGGCT 2050 

CTAAAAAATT CAGGCCTGCT CCCGCCGCTG CTTTTGCACG TGAGTGCCGC 2100 

2 0 GGATTTTATT TCGAGTTACA AGAATTGAAG GAAGACGATT ATTATGGGAT 2150 

TACTTTATCT GATGATTCTG ATCATCAGTT TTTGCTTGCC AACCAGGTTG 22 0 0 

TCGTC CATAA TTG CGGAGAA AGAGGTAATG AAATGGCAGA AGTCTTGATG 2 250 

GAATTCCCAG AGTTATATAC TGAAATGAGC GGTACTAAAG AACCAATTAT 23 00 

GAAGCGTACT ACTTTGGTCG CTAATACATC TAACATGCCG GTTGCAGCCA 23 50 

25 GAGAAGCTTC TATTTACACT GGTATCACTC TTGCAGAATA CTTCAGAGAT 24 0 0 

CAAGGTAAAA ATGTTTCTAT GATTGCAGAC TCTTCTTCAA GATGGGCTGA 24 50 

AGCTTTGAGA GAAATTTCTG GTCGTTTGGG TGAGATGCCT GCTGATCAAG 2500 

GTTTCCCAGC TTATTTGGGT GCTAAGTTGG CCTCCTTTTA CGAAAGAGCC 2 550 

GGTAAAGCTG TTGCTTTAGG TTCCCCAGAT CGTACTGGTT CCGTTTCCAT 2 600 

30 CGTTGCTGCC GTTTCGCCAG CCGGTGGTGA TTTCTCAGAT CCTGTTACTA 2 650 

CTGCTACATT GGGTATCACT CAAGTCTTTT GGGGTTTAGA CAAGAAATTG 2 700 

GCTCAAAGAA AGCATTTCCC ATCTATCAAC ACATCTGTTT CTTACTCCAA 2 750 

ATACACTAAT GTCTTGAACA AGTTTTATGA TTCCAATTAC CCTGAATTTC 28 00 

CTGTTTTAAG AGATCGTATG AAGGAAATTC TATCAAACGC TGAAGAATTA 2 850 

35 GAACAAGTTG TTCAATTAGT TGGTAAATCG GCCTTGTCTG ATAGTGATAA 2 9 00 

GATTACTTTG GATGTTGCCA CTTTAATCAA GGAAGATTTC TTGCAACAAA 2 950 

ATGGTTACTC CACTTATGAT GCTTTCTGTC CAATTTGGAA GACATTTGAT 3 0 00 

ATGATGAGAG CCTTCATCTC GTATCATGAC GAAGCTCAAA AAGCTGTTGC 3 0 50 

TAATGGTGCC AACTGGTCAA AACTAGCTGA CTCTACTGGT GACGTTAAGC 3100 

40 ATGCCGTTTC TTCATCTAAA TTTTTTGAAC CAAGCAGGGG TGAAAAGGAA 3150 

GTCCATGGCG AATTCGAAAA ATTGTTGAGC AC TATGCAAG AAAGATTTGC 3200 

TGAATCTACC GATTAA 3216 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 691 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 60 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schlzoaaccharomycea pombe 

(B) STRAIN: 972 h- 

(C) ACCESSION NUMBER: S47814 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691 



ATGGCGGGAG 
TTACGACGAG 
5 CTGTCGTTGT 
CGCGTTGGTC 
TAAATGTACT 
ATCCTGTCCA 
TTAGCTGAGA 

10 CGACAAATCT 
TTAATCGTGA 
GGCGATCATG 
TCTTTTCAAT 
TCACATATAT 

15 GAAGTCGAGT 
GCCTGTCCGT 
CTTTATTGAC 
GGTGGCACTA 
TTCACAATCT 

2 0 GTTGTGGTGA 
GAACTAACAA 
TACATTGGTA 
CCATTTATAC 
AACGTTTCAA 

2 5 TGAAATTTCT 

CTTATTTGGG 
CGTTGCTTGG 
TGTTTCTCCT 
TGGGAATTGT 
30 AAACACTTTC 
TGCTTTGCAA 
GTGAT CAAAT 
ATTCAGTTGG 
GGACATAGCC 

3 5 CTGATTACGA 

AACATGATTG 
CGTTCCTTGG 
TAACCTCGAT 
GAACACTATG 

4 0 GAC TGAGTAA 



GAATTGAACT 
CATGAAAACC 
TGCAGCCAAT 
ATGAAGAACT 
ATTCAAGTAT 
ACGCACTGGA 
CTATTTATGA 
CAAAGTATTT 
GCATAAGTGG 
TATCCGGTGG 
GATCATAAAA 
TGCTGAAGCT 
TTAATGGCAA 
GCTGCTCGTC 
TGGTCAACGT 
CTGCTATCCC 
CTTTCTAAGT 
AC GTGGAAAC 
TTGATATTAA 
GCCAACACTT 
CGGTATTACA 
TGATGGCTGA 
GGTCGTTTGG 
TGC CAAATTG 
GAAGTC CTGA 
CCGGGTGGTG 
TCAAGTCTTC 
CCTCAATCAA 
CCTTGGTATG 
CAAACAGATC 
TTGGTAAGTC 
GGTATTATTA 
TCGCTGTTGC 
CTTACTACAC 
TCAAAGATTA 
GAAATTCGAA 
AAACTCTGCA 



GGCCAAGAAG 
GATATGGATC 
ATGCTTGGAT 
AGTTGGTGAA 
AC GAAGAG AC 
AAGCCATTAT 
TGGTATCCAA 
ATATTCCTAG 
GATTTCACAC 
TGATGTTTTC 
TTATGTTACC 
GGATCATACC 
GAAACATTCT 
CAGTTGCGGA 
GTTTTGGATG 
CGGTGCCTTT 
ACTCTAATTC 
GAAATGGCAG 
TGGTAAACCA 
CTAACATGCC 
CTTGCTGAAT 
TTCTACATCT 
CTGAGATGCC 
GCTTCTTTTT 
CCGTGAAGGA 
ATTTTTCTGA 
TGGGGTTTGG 
CACCTCTCTT 
AGGAAAGAGT 
ATTCAACAAG 
GGCTCTTTCT 
AGAATGACTT 
CCTCTTTACA 
AAAGGC TAAA 
AAGAAAGTAC 
AACCCTAATG 
CAAGAAGATT 



GCTATCAGGA 
TATTTTCAGC 
GTTCGATGTA 
GTAATTCGTA 
GTCCGGTCTC 
CTGTTGAATT 
CGTCCGTTAA 
AGGTATTAAT 
CAAATAAGGA 
GGTTCTGTAT 
CCCTAGAGCC 
ATGTTGATGA 
TTTAGTATGT 
CAACTTAACT 
CGTTATACCC 
GGTTGTGGTA 
TGATTTGATT 
AAGTGTTAAT 
GAGCCCATTA 
TGTCGCTGCT 
ATTATCGTGA 
CGTTGGGCTG 
TGCCGATTCT 
ACGAACGTGC 
ACAGTTT CAA 
TCCTGTTACT 
ACAAGAAATT 
TCCTATTCTA 
TCCAGGCTTT 
AAGATTC CAT 
GAAACGGATA 
CTTACAACAA 
AGACTTATCA 
AGTGCCGTTG 
TTCAGATATC 
AAGGCGAGAA 
GAGGACAAGT 



GCCTCAAAAA 5 0 

GTTTCTGGTC 100 

CGAACTCGTT 150 

TCCATCAAGA 2 00 

ACTGTTGGTG 250 

AGGTCCTGGT 3 00 

AGCAAATTTT 35 0 

ACAGAATCAC 400 

TTTACGCATT 4 50 

TTGAAAACTC 50 0 

CGTGGTACCG 550 

AAAACTTCTT 600 

TGCATACTTG 650 

GCTAATCAAC 7 00 

CTGTGTTCAA 750 

AAACAGTTAT 800 

GTTTACGTCG 850 

GGATTTCCCA 90 0 

TGAAGCGTAC 950 

CGTGAAGCTT 1000 

TCAAGGTAAG 10 50 

AAGCTTTGCG 1100 

GGTTATCCCG 1150 

TGGTCGTGCT 1200 

TTGTTGGAGC 12 5 0 

AGTGCAACCT 130 0 

GGCCCAACGT 13 50 

AATACATCAA 14 0 0 

AATACTCTTC 14 50 

GTTGGAAATT 1500 

AAGTTACTTT 1550 

AACGGTTATT 1600 

TATGATGCGA 1650 

AAACTGGTAG 1700 

TTTTATGAGT 1750 

GGAAATAGTC 1800 

TTCACACTCT 18 50 
1860 



2) INFORMATION FOR SEQ ID NO: 692 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma congolense 

(B) STRAIN: IL3000 

(C) ACCESSION NUMBER: Z25814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692 
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ATGACGAGCG ATAAAAACCC TTACAAAACA GAGCAGCGCA TGGGGGCCGT 5 0 

GAAGGCCGTC TCCGGGCCAG TTGTCATTGC TGAAAACATG GGCGGTAGCG 10 0 

CTATGTATGA GCTTGTGCAG GTAGGTTCCT TCCGGTTAGT GGGCGAGATC 15 0 

ATTCGTCTAG AGGGCGATAC CGCCACTATT CAGGTCTATG AGGAAACAGG 20 0 

5 TGGCCTCACT GTCGGAGACC CGGTGTACTG TACGGGTAAG CCTCTTTCGC 250 

TTGAGCTTGG ACCTGGAATC ATGTCTGAAA TATTTGACGG T AT CCAGCGG 3 00 

CCTCTTGACA CCATCTACCG CATGGTGGAA AACGTGTTTA TCCCCAGGGG 35 0 

CGTTCAGGTG AAGTCACTCA ATGAC CAGAA ACAGTGGGAC TTTAAG CCAT 4 00 

GCCTGAAGGT TGGAGATCTT GTGTCTGGTG GTGATATCAT TGGCTCAGTG 450 

10 GTGGAGAACT CTCTCATGTA CAATCACAGC ATTATGATTC CGCCCAATGT 50 0 

GCGGGGCCGT GTTACTTCGA TTGTTCCTTC AGGAAATTAC ACCCTCCAAG 550 

ATGACATTAT TGAATTGGAA TATAATGGGA CAGTGAAATC ACTAAAACTT 600 

ATGCATCGCT GGCCAGTACG GACCCCGCGT CCTGTGGCGT CAAAAGAATC 650 

CGGCAAT CAT CCGCTTCTCA CCGGACAGCG TGTGCTCGAT GCTCTCTTTC 700 

15 CATCCGTCCA GGGTGGAACA TGCGCCATCC CTGGCGCGTT TGGATGCGGA 750 

AAGACGGTTA TCAGTCAGGC TCTTTCGAAG TTCTCCAACA GCGACGCTGT 800 

TATCTATGTC GGCTGCGGCG AGCGTGGGAA TGAGATGGCA GAGGTGCTCA 850 

TGGACTTCCC CACACTCACC ACCGTTATTG ATGGTCGTGA GGAGTCCATC 900 

ATGAAGCGTA CCTGCCTGGT GGCAAACACC T CAAATATGC CTGTCGCTGC 950 

20 TCGTGAGGCG TCTATTTACA CTGGCATCAC TTTAGCTGAG TATTATCGTG 1000 

ATATGGGCAA GCACATTGCT ATGATGGCCG ACTCTACCTC TCGATGGGCT 1050 

GAGGCTCTCC GTGAGATCTC TGGGCGTCTC GCTGAAATGC CCGCTGATGG 1100 

TGGTTACCCT GCGTACCTCA GTGCGCGTCT TGCTTCCTTC TACGAGCGTG 115 0 

CGGGGCGCGT GACATGCATC GGTGGG CCAA AACGCGAGGG CTCAGTAACC 1200 

25 ATCGTTGGTG CCGTTTCTCC TCCTGGAGGT GACTTTTCTG ACC CAGTGAC 12 50 

GTCCGCTACG CTTGGTATTG TGCAAGTCTT TTGGGGTCTT GAGAAGCGTC 13 00 

TTGCGCAACG TAAACACTTT CCTTCTGTTA ATTGGCTCAT TTCCTATTCA 13 50 

AAATACCTTA ATGCTTTGGA GCCCTTCTTC AACACGCTTG ACCCTGACTA 14 00 

CATGCGCCTG CGGTCAGTTG CTGCGGAGAT CCTTCAGCGT GAGGAAGAGT 1450 

30 TGCAAGAAAT TGTTCAACTT GTCGGTAAGG ACTCACTTTC GGAGTCTGAC 1500 

AAAATTATTC TAGAAACGGC TAAGGTTATT ' CGTGAAGAGT TTCTCCAGCA 1550 

GAATGCCTTT ACGCCGTACG ACAAGTATTG CCCGCCGTAC AAGACCTGCT 160 0 

GGATGCTACG TAACATTGTC GCGTTCTACG AGGAGAGCCA GCGCGTTGTA 1650 

GCTGAGTCCG CTGGGGAACT TAAGATTACG TGGAACTACA TTCGTGAAAT 1700 

35 GATTCCTCAT ATTTACAC GG GTTTAACTGA GATGAAGTTC CGTGATCCTC 1750 

AGGAGGGTGA GGAGGC CAAC GTAGAATTCT ACAGAAAACA AAATGAGGAA 18 00 

ATTGTCAGCG CATTCGCCTC GCTGCTGCAA TAA 1833 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 693 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 58 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus thermophilics 

(B) STRAIN: HB8 

(C) ACCESSION NUMBER: D63799 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 



ATGATCCAAG GGGTGATCCA GAAGATCGCG GGCCCGGCGG TGATCGCCAA 50 
GGGCATGCTC GGGGCCCGCA TGTACGACAT CTGCAAGGTG GGCGAAGAGG 100 
60 GCCTCGTGGG CGAGATCATC CGCCTGGACG GGGACACGGC CTTCGTCCAG 150 
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^^^nn arAPCTCGGG CCTAAAGGTG GGGGAGCCCG TGGTCTCCAC 
GGGCCTTCCC SSSwCCC CGGGATGCTG AACGGCATCT 

SrSSoCAT CCAGCGCCCC CTGGAGCGCA TCCGGGAGAA GACGGGGATC 

?aS?cac?c Sggcgtggt ggtccacgcc ctggaccggg agaagaagtg 
GGCCTGGACG cccatggtca agcccgggga cgaggtgcgg gggggtatgg 
??c?gggcac ggtgcccgag ttcggcttca cccacaagat cctggtaccc 
?cgga?gtcc ggggccgggt caaggaggtg aagcccgccg gggagtacac 
St^gagSg ccggtggtgg tcctcgagga cggcaccgag ctcaagatgt 

S?2S2cTG GCCCGTTCGC CGGGCGAGGC CCGTGCAAAG GAAGCTTGAC 
cSSacaccc ccttcctcac GGGGATGCGC ATCCTGGACG tcctcttccc 
cg5SScca?§ ggSgcaccg ccgccatccc tgggcccttc ggcagcggca 

SaIStgAC CCAGCAGTCC CTGGCCAAGT GGTCCAACGC CGACGTGGTG 

S?c5acg?Sg otgcgggga gcgggggaac gagatgaccg acgtgctcgt 

rrll^CCCC GAGCTCACCG ACCCCAAGAC GGGTGGGCCC TTGATGCACC 
SScSorccS catcgccaac ACCTCCAACA TGCCCGTGGC CGCCCGCGAG 
gSagStct acgtgggcgt gaccatcgcc gagtacttcc gcgaccaggg 

r?5c?G?GTG GCCCTCATGG CCGACTCCAC GAGCCGCTGG GCCGAGGCTT 
SSSSSS CTCTAGCCGC CTCGAGGAGA TGCCCGCCGA GGAGGGCTAC 
^rr^Zr TCGCCGCCAG GCTCGCCGCC TTCTACGAGC GGGCGGGCAA 

Sg?ca?cacc ??gSgSggcg aggagggggc ggtgaccatc gtgggggccg 
?SccccScc gggcggcgac atgtccgagc ccgtgaccca gtccaccttg 

ISarrrTCC GGGCCTTCTG GCGGCTTGAC GCCTCCCTGG CCTTCCGCCG 
5c2™CCC GCcS5cAACT GGAACGGCTC CTACAGCCTC TTCACCTCCG 
C^TTOACCC CTGGTACCGG GAGAACGTGG CCGAGGACTA CCCCGAGCTC 
cgSIcgcS TCTCCGAGCT TTTGCAGCGG GAGGCGGGCC TCCAGGAGAT 
COTCCAGCTC GTGGGGCCGG ACGCCCTCCA GGACGC CGAG CGCCTCGTCA 

??gaggtSgg ccggatcatc cgcgaggact tcctgcagca GAACGCCTAC 
£a?gSgg?gg ACGCCTACTG CTCCATGAAG aaggcctacg ggatcatgaa 

?A?GA?CC?C GOTTCTACA AGGAGGCGGA GGCGGCCATC AAGCGGGGGG 

???cc55Sga cgagatcctg cagctccccg ttctggagcg catcggccgc 
gcccS?acg tgagcgagga ggagttcccc gcctactttg aggaggccat 
gaaggSStc cagggggcct tcaaggctgg cctaaagggg gagagatgga 



ccttctga 



2) INFORMATION FOR SEQ ID NO: 694 

(i) SEQUENCE CHARACTERISTICS: 
i0 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694 
CGGCGCIATC YTSGTTGTTG C 

50 



2) INFORMATION FOR SEQ ID NO : 695 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695 
GTTTCACGTG ATGACGTACA 



2) INFORMATION FOR SEQ ID NO: 696 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 
ATIGGICAYR T I GAYCAYGG IAARAC 



2) INFORMATION FOR SEQ ID NO: 697 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 
CCIACIGTIC KICCRCCYTC RCG 



2) INFORMATION FOR SEQ ID NO: 698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from J01690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 
TATCGGCCAC GTTGACCACG GTAAAACTAC TCTGACCGCT GCAATCACCA 
CCGTACTGGC TAAAACC TAC GGCGGTGCTG CTCGTGCATT CGACCAGATC 
GATAACGCGC CGGAAGAAAA AGCT CGTGGT ATCACCATCA ACACTTCTCA 
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CGTTGAATAC GACACCCCGA CCCGTCACTA CGCACACGTA ggGCCgQ 
GGCACGCCGA CTATGTTAAA AACATGATCA CO0B1W. CGCAGACTCG 
GGCGCGATCC TGGTAGTTGC TGCGACTGAC GGCCUoAi ATCATCGTGT 
TGAGCACATC CTGCTGGGTC GTCAGGTAGG GGAACTGGTT 
5 TCCTGAACAA ATGCGACATG GTTGATWV^ TACGACTTCC CGGGCGACGA 
GAAATGGAAG TTCGTGAACT JCTGTCTCAG qgcgacGCAG 
CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCiO ^^^^ 
AGTGGGAAGC GAAAATCCTG GAACTGGCTG GCTTCCl^ CGATCGAAGA 
CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCT CGTGTAGAAC 
10 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TATCAAAGAG 
G CGGT AT CAT CAAAGTTGGT GAAGAAGTTG AAATCGTT ^ =TQCTOGft 
ACT CAGAAGT CTACCTGTAC TGGCGTTGAA f f GGTATCAAAC 
CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT JCTGCi CACCATCAAG 
GTGAAGAAAT CGAACGTGGT CAGGTACTGG JTAAG msmbmbq 
15 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATT ^ TTCTACTTCC 

SS SSKSS sgjggs SKS33 

SSSSSS2 SS5SSS II^AGGC GGCCGTACCG 

TTGGCGCGGG CGTTGTTGCT AAAGTTCTGG GCTAA 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 699 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699 
GTIACIGGYT CYTYRARRTT ICCICC 



26 



40 2) INFORMATION FOR SEQ ID NO: 700 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
45 (c) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 700 

24 

TIRTIGAYGT CGARTTCCCT CARG 



2) INFORMATION FOR SEQ ID NO: 701 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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10 



(C) S TRANDEDNE S S : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701 
GTGTTCACGA TCATCGATGC G 21 

2) INFORMATION FOR SEQ ID NO: 7 02 



(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

2 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 02 
CTCTCGATAT CCGCGAAGCG 2 0 

25 

2) INFORMATION FOR SEQ ID NO: 703 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703 

4 0 TATGGAAATT CGAAACATCT 2 0 



45 



2) INFORMATION FOR SEQ ID NO: 704 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 04 

55 

AGTGCTCCAA TTAATGTTGG 2 0 



60 2) INFORMATION FOR SEQ ID NO: 705 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 705 
GTACAGTTCC AATACCTGAA 



2) INFORMATION FOR SEQ ID NO: 706 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706 
TGAAATCTTC ACATCCAACA 



2) INFORMATION FOR SEQ ID NO: 7 07 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707 
TWACCATTTC AGTACCTTCT GGTAA 



2) INFORMATION FOR SEQ ID NO: 708 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708 
TCRTCCATIC CIARIATIGC IATIAT 
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2) INFORMATION FOR SEQ ID NO: 709 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Borrelia burgdorferi 

(C) ACCESSION NUMBER: extracted from AE00783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709 

2 0 ATGAATGAAG TTTTATTTGT AAAGACTGCT GGTAGGAATT TAAAAGCAGA 50 

AGTAATTCGT ATTAGGGGCA ATGAAGTTGA TGCACAGGTT TTTGAATTGA 10 0 

CAAAAGGGAT ATCTGTTGGA GACCTAGTTG AATTTACAGA CAAACTTTTA 150 

ACAGTTGAAC TCGGACCAGG GCTTTTAACT CAAGTATATG ATGGGCTTCA 200 

AAATCCTTTG CCTGAATTGG CTATTCAATG TGGATTTTTT TTAGAAAGGG 250 

25 GAGTATATTT AAGGCCCTTG AATAAAGATA AAAAGTGGAA TTTTAAAAAA 3 00 

ACCTCCAAAG TTGGAGATAT CGTTATTGCA GGAGATTTTT TAGGTTTTGT 3 50 

AATTGAGGGA ACTGTTCACC ATCAAATAAT GATTCCATTT TATAAAAGGG 400 

ATTCTTATAA AATTGTGGAG ATTGTAAGTG ATGG CGACTA TTCGATTGAT 4 50 

GAGCAAATTG CTGTAATTGA AGATGATTCT GGTATGAGGC ATAATATTAC 500 

3 0 AATGTCTTTT CATTGGCCTG TTAAAGTTC C TATTACTAAT TATAAGGAAC 550 

GCCTTATTCC TAGTGAACCT ATGTTGACTC AAACTAGAAT TATAGATACA 60 0 

TTTTTCCCAG TTGCCAAAGG TGGAACTTTT TGCATTCCGG GTCCTTTTGG 650 

AGCAGGAAAA ACGGTTCTTC AGCAGGTTAC AAGTCGAAAT GCTGATGTTG 70 0 

ATGTAGTGAT TATTGCAGCT TGTGGTGAGC GAGCAGGAGA AGTGGTAGAA 75 0 

3 5 ACTCTTAAAG AATTTCCCGA ATTAATGGAT CCAAAAACCG GCAAATCTTT 800 

AATGGACAGG ACTTGTATTA TTTGTAATAC ATCTTCAATG CCAGTTGCAG 850 

CTAGAGAAGC TTCTGTTTAT ACTGCTATTA CTATTGGTGA GTATTACAGG 90 0 

CAAATGGGCC TTGATATTCT TCTTTTGGCA GATTCAACTT CAAGATGGGC 950 

TCAAGCAATG AGAGAAATGT CTGGACGCCT TGAGGAAATT CCTGGCGAGG 100 0 

4 0 AGGCTTTTCC GGCATATCTT GAGTCTGTTA TTGCTTCCTT TTATGAAAGG 1050 

GCAGGTATTG TAGTTCTTAA TAATGGGGAT ATTGGATCTG TAACAGTTGG 110 0 

TGGCTCTGTA AGTCCTGCTG GTGGTAATTT TGAAGAGCCA GTTACTCAAG 1150 

CAACTTTAAA AGTTGTAGGA GCATTTCACG GGCTTACAAG AGAAAGGTCT 12 00 

GATGCTAGGA AATTTCCAGC TATTAGTCCT CTTGAATCTT GGAGTAAATA 12 50 

4 5 TAAAGGCGTT ATTGATCAAA AAAAGACTGA ATATGCAAGA TCTTTTTTGG 13 00 

TGAAAGGTAA TGAAATTAAT CAAATGATGA AAGTTGTTGG AGAAGAAGGC 13 50 

ATAAGTAACG ATGATTTTTT AATTTATTTA AAATC CGAGC TACTTGATTC 1400 

GTGCTATTTG CAGCAAAATT CATTTGATTC TATTGATGCT GCTGTTAGTT 14 50 

CAGAGCGTCA AAATTATATG TTTGATATAG TTTATAACAT TCTTAAAACT 1500 

50 AACTTTGAGT TTTCTGATAA ACTTCAAGCA AGAGATTTTA TAAATGAGTT 1550 

AAGGCAAAAT CTTTTAGACA TGAATCTTTC TTCTTTTAAG GATCATAAGT 1600 

TTAATAAATT GGAGCATGCT TTGGGTGAAT TGATAAATTT TAAAAAGGTA 1650 

ATTTAG 1656 



55 



2) INFORMATION FOR SEQ ID NO: 710 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1818 bases 
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10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols .r^n^on 

(C) ACCESSION NUMBER: extracted from AE000520 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710 
PTGATCAAAG ACGATGTGGT TACAGGCCGT GTAGTGAGGG TGTCTGGTCC 



' - SSSjSK? SSSaSgScC tcJSScGTC CAGCGTATAC GATGTTGTCG 100 
" SS S«C ™GAAA ™TCCGGTT GG~GC 1 = 0 

250 

20 SSSSSS SSSSKot SSSkSSot GCGCGTTCAC AACCGCTTGA 350 
TGGCTCa3TA CGCTGGGATT TTCGTCCTCA TTGTAACGAG CGCGGTGAGG 400 

500 



JSSSSES tgcaagtata cgaggatgac acaggtatgc gagtcgggga 

GAAGGTGACA AGCTTGCGTC GACCACTCTC AGTCCGCTTA GGGCCTGGAT 
TAA^GGCAC CATTTATGAC GGTATTCAGC GCCCACTTGA GCGCCTCTTC 



gScA^AGT? ACTGGACAGC GGGCGATTGA TGTTTTCTTC CCCCTATCAA 
Sgggaggaac GGCGGCTATT CCAGGGGGAT TTGGAACTGG GAAGACAATG 
a2g?agcatg ccgttgccaa GTGGTGTGAT gcagatatta tcgtgtacat 



SSSSSSS ggggattccg attgcacctg ggtcagtgtt agggaccgtg 
caggagactc cttctgttgt gcacactatc atggttcctc ctgacatccg 

S^AGCGTG CTATCTTCGT TCAAGGGCGC AGGTGCTTAC ACAATAGATG 550 

25 AA?AAA??5S ACGCACTGAT CTTGGTGAGC CGCTTTTTCT ATCCCAGTAC 600 

TGGCCAGTGC GTCGTGCGCG TCCTTTCAGC AAAAAACTTG CAGTGTGTGA 650 

750 

30 SSg^gSgA gXgCGGGGC^ ACGAGATGAC AGACGTGCTC TCTGAATTTC 850 

CCaS?TCAT CGATCCGCGC ACAGGACGCT CTCTTATGGA GCGGACGATT 900 

TTGATCGCAA ATACGTCCAA TATGCCTGTG TCCGCACGCG AGGTGTCGCT 950 

g?a??caSgg ATTACCCTTG CGGAATACTA CCGTGATATG GGTATGCATG 1000 

??ScATCAT GGCTGATTCT ACCAGCCGCT GGGCGGAGGC GCTGCGTGAA 1050 

35 SSSSS? gcatggaaga AATGCCTGCG GAGGAGGGAT TCCCTGCGTA 1100 

SScCGACG CGTCTTGCAG AATTTTATGA GCGCGCAGGA CGCGTGGAAA 1150 

CCTC?G?GGC GCGCGAGGGC TCTGTGAGCA TCATTGGTGC TGTTTCTCCC 1200 

SgGGTGGAG ATTTCTCTGA GCCGGTGACG CAGCACACAA AGCGCTTCAT 1250 

CCOTGOT? TGGGC CTTGG ATCGTGAACT TGCACACGCG CGTCATTACC 13 00 

40 C 5gCCa5tGG GTGGATAGAT TCATACTCTG AATATGCGCA GGAAGTAAGT 1350 

C CATGGTGGA GTAAGTATGA CCCGCGCGCA GGCGCGTTGC GCGCCGCAGC 1400 

SSSSSSS CTGAGAAAGG AACAGCGGTT ACAGCAAATT GTCAGGCTTG 1450 

TCGGTCCTGA TGCGCTGCCT GGAGAAGATC GTCTGGTGCT AATGGTGTGT 1500 

GAAATGATCA AAGGTGGCTT TCTGCAGCAG AACGCTTTTG ATCCGACGGA 1550 

45 S£ot5cTCC TGTCCCGAAA AGCAGGTGCA GATCTTGCGT ACCATAGTGG 1600 

45 a?Sca?GA ACGTGCCGTG GTGCTGCTGC GTGCAGGTAT TTCGCTTTCT 1650 

G^GCTGTCCC AGCTTTCGTG CCGGGAGCTC ATCGTACGTA TGAAAACTAC 1700 

GTACGGGAAT GAGGATGTAC ACAAGATGCA GAAAGTGTAC GACACGATGT 1750 

G^CTGAGTT TGACCAACTG AGTGTGTGTG CTGCCGCGCG CACACAAGGG 1800 
50 GGGGAGAAAG TCGAATGA 



2 ) INFORMATION FOR SEQ ID NO: 711 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Chlamydia trachomatis 

(B) STRAIN: MoPn 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711 

10 CAGGGCTATG TCGTAGAAGC TTAC GGAAAT TTATTG CGGG TGCATTTTGA 
TGGGCATGTG CGTCAAGGAG AAGTGGCCTA TGTCAGCGTG GATGATACTT 
GGTTGAAAGC GGAAATTATA GAAGTTGTGG GAGATGAGGT TAAAGTCCAA 
GTTTTTGAGG AAACTCAAGG AATTTCTCGA GGCGCTTTGG TAACTTTTTC 
CGGGCATTTA TTAGAAGCGG AACTTGGGCC CGGTCTATTG CAAGGTATTT 

15 TTGACGGACT TCAGAATCGC TTAGAGGTAT TGGCAGATAC AAGCTTGTTC 
TTGAAAAGAG GGGAGTATGT TAATGCCATT TGTCGGGAAA CTGTATGGGC 
TTATAC GCAA AAGGCTTCTG TCGGGGATGT TCTATCTCGG GGAGATGTGC 
TTGGTACAGT AAAGGAAGGG CGGTTTGATC ATAAAATCAT GGTTCCTTTC 
TCTTGTTTTG AGGAAGTGAC TATCACTTGG GTCATTTCTT CAGGAGATTA 

2 0 CACTGTTGAT ACCGTTATTG CTAAAGGACG TACTGCTTCA GGAGCCGAGC 
TTGAATTTAC AATGGTTCAG AAATGGCCCA TTAAACAGGC TTTTTTAGAA 
GGGGAAAAGG TACCGTCTCA TGAAATTATG GATGTTGGGT TACGAGTATT 
AGATAC TCAG ATCCCCGTCT TAAAGGGAGG AACTTTTTGT ACTCCAGGGC 
CTTTTGGTGC AGGAAAGACC GTTTTACAGC ACCATTTATC TAAGTATGCA 

2 5 GCTGTAGATA TCGTAGTTTT GTGTGCTTGT GGAGAGCGAG CTGGAGAGGT 

TGTAGAAATT CTTCAGGAGT TCCCGCATTT GACAGATCCT CATACGGGGC 
AGTCTTTGAT GCATAGGACC TGTATTATTT GTAATACATC TTCCATGCCT 
GTAGCAGCTA GAGAGTCCTC CATTTATTTG GGTATTACTA TAGCAGAATA 
TTACCGTCAA ATGGGGTTGC ATGTTTTGTT ATTGGCTGAC TCGACATCTA 
30 GATGGGCTCA AGCTTTAAGG GAAATTT GAG GGCGATTAGA AGAAATCCCT 
GGAGAAGAAG CTTTCCCAGC CTATTTGGCG T CTCGAATAG CAGCTTTTTA 
TGAGCGAGGC GGGGCTGTGA AAATGAAAGA TGGATCGGAA GGCTCCTTGA 
CTATCTGTGG AGCGGTTTCT CCCGCAGGAG GAAATTTTGA AGAGCCTGTT 
ACACAAGCAA CTTTATCTGT TGTTGGGGCT TTCTGTGGGC TTTCTAAGGC 

3 5 TAGAGCAGAT GCTAGACGGT ATCCTTCTAT TGATCCGATG ATTTCATGGT 

CTAAGTACTT GGATTCTGTG GCGGAGATTT TAGAGAAAAA AGTTCCAGGA 
TGGGGAGATT CCGTTAAAAA AGCTTCTCGT TTCTTAGAAG AAGGAGCAGA 
AATTGGTAAG CGAATAGAAG TTGTTGGGGA AGAAGGGATT TCTATGGAAG 
ATATAGAAAT CTTTTTGAAA TCAGAGTTGT ATGATTTCTG TTACTTACAG 

4 0 CAAAACGCTT TCGATGCAGA GGACTGTTAT TGTCCTTTTG ATCGTCAAAT 

AGAGCTTTTT TCTTTAATGA GTCATATTTT TAGCTCTAGA TTCTGTTTTG 
ATTGTCCAGA TAATGCTCGG AGTTTCTTTT TAGAGCTTCA AAGTAAAATT 
AAAACGCTGA ATGGTCAAAA ATTC CTTTCT GAAGACTATC AGAAGGGGCT 
AGAAGTGATC TATAAACTAT TAGAAAGCAA AATGGTGCAG ACGGCGTAGG 
4 5 TATGCAAACA ATATATACAA GAATTACGG 



2 ) INFORMATION FOR SEQ ID NO: 712 



50 
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50 



55 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 965 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 



60 
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(B) STRAIN: V583 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712 

SESSS SSSSES SSS SSSSSSS SSS^ 
™ ISSESS SEEKS E5SS3S 

SSIgJS^ ttatctgttg aattagcccc aggtttaatt gccgaaatgt 
?S£SSSt tcaacgacca ttggatacat ttcaagaagt aacccacagt 

SrTTTTTAG GCCGTGGCGT TAAAATTGAT GCGTTAGATC GTGAGAAAAA 
ISSJcgttt gSccaactg TGGCAGTTGG tgaagaagtg TCGGCAGGTG 
aSt£g?cg£ TGTGGTTCAA gaaacaccga ttattcaaca taaaattatg 
?S?ct?tcg gcgtttcagg aacgattgcc gaaattaaag caggtgactt 
gaaacagttt actcagtgga aacggctaaa ggaacggaaa 
tcgcccgttc ggcggggacg tcccatttta 
gaaaaactaa gtcccaaagt accgatggtg accggacaac gcgtaattga 

?A^??TC CCAATTACGA AAGGCGGAGC GGCAGCAGTT CCAGGACCAT 
TTGGCGCTGG AAAAACAGTC GTTCAGCACC AAATTGCTAA GTGGGCCGAT 
GTCGACTTAG TCGTTTACGT TGGTTGTGGG GAACGCGGGA ATGAAATGAC 

SJtoStta aatgaatttc cagaattaat tgacccaaca ACTGGTGAGT 
ot?£?gaa" togacgatt ttaattgcga atacgtcaaa tatgccggta 
SSSScSS aaIcctcgat ttatacaggg attaccattg cagaatattt 



CCGTGATATG GGTTA 



25 



2) INFORMATION FOR SEQ ID NO: 713 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1737 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3b (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina barJcen 
40 (c) ACCESSION NUMBER: J04836 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713 

r TGGAAGT AA AAGGTGAAAT TTATCGTGTG TCTGGGCCTG TCGTCACCGC 
a* SSSScSS CAGGCAAAAA TGTATGACCT GGTCAAAGTC GGTAATGAAG 100 
45 g???SS5g SJaGTCATT CAGATATTAG GGCCCAAGAC CATCATCCAG 150 



GTATATGAAG AGACCGCAGG TATCAAGCCTA GGGGAACCCT GTGTATCTAC 
ACGGTCGTCT CTGTCCGTAG AACTTGGTCC GGGTCTTCTT TCCAGTATTT 
ATGACGGGGT TCAAAGGCCT CTGCACGTCC TGCTTGAAAA AATGGGTAGC 
so ??2a?ccaga gaggtgtcag CGCAGATGGG CTTGATCATA AGAAACTCTG 
SStttcaaa CCCATTGTCA AGAAGGGCGA TTCCGTAAAA GGTGGAGACG 
TAATTGGTGT TGTACAGGAA AC CGTGAATA TTGAACATAA GATCATGGTG 
CCTCCTGATA TCTCAGGTAC AATTTCCGAC ATAAAGAGCG GAAACTTTAC 
SSotagac ACAATCTGTA CTCTGACTGA TGGGACCGAA TTGCAGATGA 

55 tcSgaggtS gcctgttcga agacccagac ctgtgaaggc aaaacttact 
Saaccaggc ctctggttac aggaatgaga atccttgatg ggcttttccc 
?g^S5aa ggcggaacag ctgcaatccc cggacctttc ggatcgggaa 
Igaccgtaac tcagcagtcg cttgcaaaat ggagtgatac cgaaattgtg 

OTCTaStCG GTTGTGGTGA GCGTGGAAAC GAAATGGCAG ATGTTCTGAG 

60 SSttccct gaactcgaag atccgcagac cgggcgccca cttatggagc 



50 
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GTACTGTTCT TATCGCTAAC ACTTCAAACA TGCCTGTGGC CGCAAGAGAA 900 

GCATCTGTGT ATACCGGAAT CACCATTGCA GAATACT AC C GTGACATGGG 950 

ATTAGATGTA TCCCTTATGG CAGACTCCAC CTCAAGGTGG GCAGAAGCCA 1000 

TGAGAGAAAT CTCTTCCCGT CTGGAAGAAA TGCCTGGTGA AGAAGGTTAC 105 0 

5 CCAGCATACC TGTCTGCAAG ACTGGCCGAA TTCTACGAGC GTGCCGGGGT 110 0 

TGCGGAGAGT CTTTGCGGCG AAACAGGTTC CATTACTGTT ATTGGAGCAG 1150 

TATCTCCACC TGGCGGTGAC TTCTCAGAGC CTGTTACACA GAATACCCTG 12 0 0 

CGTATCGTAA AAGTGTTCTG GGCTCTCGAT GCCAAACTAT CTCAGAGGCG 1250 

TCACTTCCCG GCCATCAACT GGCTGAACAG TTACAGTCTG TATAAGGACA 13 0 0 

10 GTCTTAATGA CTGGTTTGCA GATAATGTGG CTCCTGATTA TGTGCCTTTG 1350 

AGGGAAAGAG CAATGGAAAT GCTCCAGACA GAATCTGAAC TGCAGGAAAT 14 0 0 

CGTGCAGCTT GTAGGTTCCG ATGCTCTGCC AGACGACCAG CAGCTTCTGC 14 5 0 

TTGAAATCAC CCGTATGCTT AGGGAAATTT TCCTGCAGCA GAATGCATTC 150 0 

CACCCAGTAG ATGCATACAG CCCGTTCGAT CAGCAGTACA AGATC CTTAA 155 0 

15 GGCAATCATG AAATGGGGAG ACGCTGCGAT GGATGCCTTG AAATCAGGTG 1600 

TTCCCGTAAC TGAAATTATC AAGCTTGAAT CCAAAAATGT GCTTGCTAAG 1650 

GTCAAGTACG AAGAGAAGTT TGATGAGTCT ATGAATGCTG TCCTGGCACA 17 00 

GATGGATAAA GAGTTTGCAT CCCTGAGAGG TAGGTAA 173 7 



20 



35 



2) INFORMATION FOR SEQ ID NO: 714 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina jannaschii 
(C) ACCESSION NUMBER: U67477 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714 



ATGAAAAGAG AGGTTGAGAA TATGCCAGTT GTTGGTAAGA TTATTAAAAT 50 

CGCAGGGCCT GTTGTAGTTG CAGAGGGAAT GAAAGGAGCT CAGATGTATG 100 

4 0 AGGTCGTTAA AGTAGGAGAA GAGAAATTGA CTGGAGAAAT CATTCAGTTG 150 

CACGATGATA AAGCAGTTAT TCAGGTTTAT GAAGAAACAT CTGGAATTAA 2 00 

ACCAGGAGAG CCAGTTGTTG GTACTGGAGC TCCATTGTCT GTTGAATTAG 2 50 

GGCCAGGGAT GTTAAGAGCT ATGTATGATG GTATT CAGAG GCCTTTAACA 3 00 

GCAATTGAAG AGAAAACAGG TTCAATCTTT ATCC CAAGAG GAGTTGATGT 3 50 

45 CCCTGCATTA CCAAGAGATA TAAAATGGGA ATTTAAACCA GTGGTAAATG 4 00 

AAGGAGATTA TGTTGAAGAA GGAGACATAA TTGGAACTGT TGATGAAACT 4 50 

CCTTCAATAG TTCATAAAAT CTTAGTTCCA ATTGGTGTTA AAGGAAAAAT 500 

TGTTGAAATA AAAGAGGGTA AATTTACAGT TGAAGAGACA GTTGCAGTTG 550 

TAGAAACAGA AAATGGAGAA AGGAAAGAAA TTACAATGAT GCAAAAATGG 600 

50 CCAGTAAGAA AACCAAGACC ATATAAAGAG AAACTACCTC CAGAAATTCC 650 

ATTAATTACA GGGCAAAGAG TTGAAGACAC TTTCTTTACA TTAGCAAAAG 700 

GAGGAACAGC AGCAATTCCA GGTCCATTCG GTTCAGGAAA AACGGTTACT 75 0 

CAGCATCAGT TGGCAAAGTG GTCTGACGCT GATGTCGTTG TTTATATCGG 80 0 

ATGTGGAGAA AGAGGAAACG AGATGACAGA GGTTATTGAA GAGTTC C C AC 850 

55 ACTTAGAAGA TATTAGAACT GGAAACAAAT TAATGGATAG AACTGTATTA 900 

ATAGC CAACA CATCAAACAT GCCTGTCGCT GCAAGGGAAG CATCTGTCTA 950 

TACAGGAATT ACAATTGCAG AGTACTTCAG AGATATGGGT TATGGAGTTT 1000 

TATTAACAGC AGATTCAACA TCAAGATGGG CAGAGGCAAT GAGAGAAATT 105 0 

TCAGGTAGAT TGGAAGAAAT GCCAGGGGAA GAAGGGTATC CAGCATACTT 1100 

60 AGCTTCAAGA TTGGCTCAGT TCTATGAAAG AGCTGGAAGA GTTATAACCT 1150 
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tAACAGACAA GGATTCGTTT GTATCGTTGG AGCTGTTTCA 1200 

l^^^tr SSSScTC AGAACCAGTT ACATCAAACA CACTAAGGAT 12 50 

AOT?Sgg5a ??c?GGgS5 tScAAA CTTGGCAAGA AGAAGACACT 1300 

TCCCAOTAT CAACTGGTTG CAGAGTTATT CATTATACAT TGATGATGTT 1350 

_ TSSSnSrrT GGAACACAAA TACTGGTCCA GATTGGAGAC AATTAAGAGA 14 00 

5 Sg^StTAC AAAAAGAGGC AGAGTTGCAA GAGATTGTTC 1450 

T^J^P ScSStGCA TTGCCAGATA GGGAGAGAGT TATTTTAGAA 1500 

TGTTGAGGGA GGATTTCTTA CAGCAAGATG CGTTTGATGA 1550 

SgSSaTACC TACTGTCCTC CAATGAAACA GTACTTAATG TTAAAGATAA 1600 

10 ?5ItGaStT CTACCAAGAA GCATTGAAGG CAGTTGAAAG AGGAGTTGAA 1650 

JJSgctSgA TTTTAGGAGT TTCAGTTAAG CAAGATATTG CAAGAATGAA 1700 

SaCA^C^CA ScGATGAGT TTATAAATGT TAAATCAAAA GAAATAATGG 1750 

AGAAAATTAA GAATGAATTA GGTTCATTAA ACTAA 1785 

15 

2) INFORMATION FOR SEQ ID NO: 715 

(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 1354 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: . . 

(A) ORGANISM: Porphyromonas gmgivaxis 

(B) STRAIN: W83 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715 



30 



50 
100 

35 TATGCTTGAG ^T^^i^ - — ^'rTTrCT :AAACGAGGC 200 

250 
300 
350 

40 AATTACAAGG T™^^ T^™^ ^lairrif AATGTAACCA til 

500 
550 
600 

R prrrAAAGAC GGTGCTCCAG CA?GCTAt8t CGAAGCAGGC GGAAGCCGAT 650 
45 ATCgS SSSSS ™-GT GCAAACGAGG TTGTGGAGAT 700 

800 
850 
900 
950 
1000 
1050 



TGAGC GAGGT GAT C AAGGTG ATCGGCAAAA ATGCTTATGT GCAGGTTTTC 
^Spt£?tc GCGGTATGCA CGTAGGAGAT GAGGCAGAGT TTACCGGCAG 
SSS2SSE ctaSgctcS GCCCCGGTAT gctttcgaag aactacgacg 

GTCTGCAACA CGACTTGGAC AAGATGGACG GGAT CTTCCT CAAACGAGGC 
SSaTACTC CCGCTCTCGA TGACGACAAG CTGTGGGACT T C AAGCCTTT 
GGCCAATGTG AACGACAATG TGATCGCAGG CTCATGGCTC GGAGAGGTGA 
cSS5a5??t ccaaccgcac AAGATCATGG tacctttcgt TTTCGAAGGC 
Sttacaagg TGAAGAGTCT GGCCAAAGCC GGTTCGTACA AAGTGAACGA 
TGTGATCGCT GTGGTAACGG ATCAGGACGG GAAAGACCAC AATGTAACCA 
?gg?S?Saa ATGGCCGGTG AAACGTGCTA TCACTTGCTA TCGCGAGAAG 
IrrrcTCC^ TCAAACTGCT CGAAACGGGT ATCCGTATCA TCGACACTTT 

SSSSSS gSg^ggItS gtacgggatt tatccccggt cctttcggta 



ctttgcggaa ttcc^ccacc tgaatgaccc ccacacggga cgtaaattga 
SgIacgtac cattattatt gctaatacgt cgaatatgcc tgtggcttcg 
StgISScat ccgtatatac ggccatgacg atagccgagt actatcgctc 

r??SS?CTT CGCGTGCTGA TGATGGCAGA CTCCACTTCG CGTTGGGCAC 

50 SSSSSS TGAGATGTCT aaccgtctgg aagagcttcc cggaccggat 

rrTTTCCCGA TGGACTTGTC AGCTATCGTA GCCAACTTCT ACGCTCGTGC 
SSSaCOT? TACCTGAACA ACGGTTCGGC CGGTTCGGTA ACGTTCATCG 

otSSSSc Scc52cggt GGTAACCTCA AAGAGCCTGT GACGGAAAAC 1100 

c * aPCARGAAAG TGGCTCGCTG CTTCTATGCT TTGGAGCAGA ATCGTGCCGA 1150 

55 c§^a£c£? ta?ccggctg TAAACCCCAT CGATAGTTAC TCGAAGTACA 1200 

?CGAA^TCC CGAATTCGAG AGCTATATAT CGAACCACAT CAGTTTACTC 1250 

JtTATATTCA ACAGAAGTTG GCAGATATAT GGATCAAATC TTACAGCAGA 1300 

CTGGACTACT AAGGTGAATG AGCTGAAGAT GCGCTTGCAT CAGGGTAAAG 1350 
60 AAAT 
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2) INFORMATION FOR SEQ ID NO: 716 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: Type 4 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716 

20 TTTGACTCAA GGGAAGATTA TAAAAGTATC GGGACCTCTA GTTATTGCAT 50 

CAGGTATGCA GGAGGCTAAT ATTCAAGATA TTTGCCGTGT AGGTAAGCTA 100 

GGGTTAATCG GTGAAATTAT TGAAATGAGA AGAGATCAGG CATCTATCCA 150 

AGTCTATGAA GAAACATCTG GTCTTGGTCC GGGAGAACCT GTTGTTACAA 2 00 

CTGGAGAACC TCTCTCGGTT GAATTAGGGC CAGGATTGAT TTCTCAAATG 250 

25 TTTGATGGCA TACAACGCCC ATTAGATCGA TTTAAATTGG CTACTCATAA 3 00 

TGATTTTCTA GTTCGTGGGG TAGAAGTTCC AAGTTTGGAT AGAGATATTA 3 50 

AGTGGCATTT TGATTCCACT ATAGCAATTG GTCAAAAAGT GAGTACGGGT 4 00 

GATATTC TTG GAACTGTCAA GGAAACCGAG GTAGTTAATC ATAAAATTAT 450 

GGTTCCTTAT GGAGTATCTG GAGAAGT CGT TTCTATTGCA TCTGGCGATT 500 

3 0 TTACAATTGA TGAAGTTGTA TATGAAATAA AAAAATTGGA CGGTAGTTTC 550 

TATAAAGGAA CGCTTATGCA AAAATGGCCT GTCCGCAAGG CGCGTCCTGT 600 

TTCTAAACGT TTAATTCCAG AAGAACCATT AATCACAGGT CAACGAGTTA 650 

TTGATGCATT CTTTCCAGTA ACCAAAGGGG GAGCTGCAGC AGTTCCTGGA 700 

CCGTTTGGAG CAGGAAAGAC AGTTGTACAA CACCAAGTAG CTAAATTTGC 750 

35 CAATGTTGAT ATTGTTATTT ATGTCGGTTG TGGAGAACGT GGAAATGAAA 800 

TGACGGATGT ACTGAATGAG TTTC CTGAGT TGATTGACCC TAATACCGGA 850 

CAATCAATTA TGCAACGGAC AGTTCTGATT GCTAATACTT CAAATATGCC 900 

TGTTGCTGCT CGTGAGGCTT CAATTTATAC AGGAATTACC ATGGCTGAGT 950 

ATTTTCGTGA TATGGGC T AC TCTGTCGCCA TTATGGCTGA TTCAACTTCA 1000 

4 0 CGTTGGGCAG AAGCGCTACG TGAAATGTCA GGACGTCTAG AAGAAATGCC 10 50 

TGGTGATGAG GGTTATCCTG CTTATCTGGG AAGTCGTATC GCTGAATATT 110 0 

ATGAAAGAGC AGGACGTTCT CAGGTTCTAG GGCTTCCAGA ACGTGAAGGA 1150 

ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA 12 00 

AC CAGTTACT CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG 12 50 

4 5 ATGCTCCGTT GGCACAGCGA CGTCATTTTC CTGCAATTAA CTGGC TTACA 13 00 

TCTTATTCAC TATATAAAGA CAGTGTGGGC ACTTATATAG ATGGTAAAGA 13 50 

GAAGACAGAT TGGAATAGTA AAATAAC TCG TGC GATGAAC TACTTACAAC 14 00 

GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 145 0 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA 15 0 0 

5 0 TTATTTGCAA CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCGTTTG 155 0 

CAAAACAAGA AGCAATGCTA AGTAATATTC TCACTTTTGC TGATCAGGCA 1600 

AATCATGCTT TAGAGTTGGG TTCTTACTTT ACAGAGATTA TGGAAGGTAC 1650 

CGTGGCAGTT CGAGACCGTA TGGCGAGAAG TAAATATGTT TCAGAAGATA 17 00 

GATTAGATGA AATCAAAATT ATATCAAATG AGA TTACA CA TCAAATTCAT 1750 

55 TTGATATTAG AAACAGGAGG TCTATAAATG AGTGTTAT 17 8 8 
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2) INFORMATION FOR SEQ ID NO: 717 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: . 
(A) ORGANISM: Burkholdena mallei 



10 <A) 

(B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717 

15 C G A T C C T G G T G T G C T C G t i U u rrHTACATCA TCGTGTTCCT 100 

150 
200 
250 

20 GCTGGGCGAA 3TGGC«ATU« — -^CTGAT GCCGGTGGA7* 3sS 

400 
450 
500 
550 

25 GACCAGGGTC ^GGC^ ^^rA nnm^GCCG GGTTCGATCA 600 

650 
700 
750 
800 

30 TGGTGATGCC GGGCGALLftAU ^i^^ — 82 3 

^ ATCGCGATGG AAGAAGGTCT GCG 



SBBBS 

CCG ^CATCA AGGGTTCGGC CGACGCGCTG GACACGTACA 

1 Hi illl = sssss ssm 
Eiil gss Sii siiE 

^tllligSsSsSS 
liil ssssss sssss ssss 



35 2) INFORMATION FOR SEQ ID NO: 718 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: ^„^n^-t 

(A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718 

50 ~~<~*r*r*r*r* nnnKTnrrar AAACGCGTGA 50 

100 
150 
200 

55 GCCGATCATC AAGCitirr^ ^^^n r^GACACGTAC 300 

350 
400 
450 
500 

60 ~ " ~ 



C CGATC C TGG TGTGCTCGGC CGCTGACGGC CCGATGCCGC AAACGCGTGA 
?ScCTG CTGGCGCGTC AGGTCGGTGT GCCGTACATC ATCGTGTTCC 

EE sssss ssssss ssss s gss 

SSSSS SSSSSS SSS SSSSSS ss 
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GGATCAGGGT CAGGCGGGCG 

AGCGTGAAGA CGTGGAGCGC 

ACGCCGCACA CGCACTTCAC 

AGGCGGCCGC CACACGCCGT 

5 TCCGTACGAC GGACGTGACG 

ATGGTGATGC CGGGCGACAA 

GATCGCGATG GAAGAAGGTC 



ACAACGTCGG TATCCTGCTG 
GGCCAGGTTC TGGCGAAGCC 
GGCTGAAGTG TACGTGCTGA 
TCTTCAACAA CTACCGTCCG 
GGCTCGATCG AGCTGCCGAA 
CGTGTCGATC ACGGTGAAGC 
TGCG 



CGCGGCACGA 550 

GGGTTCGATC 600 

GCAAGGACGA 65 0 

CAGTTCTACT 700 

GGACAAGGAA 750 

TGATCGCGCC 80 0 
824 



10 



2 ) INFORMATION FOR SEQ ID NO: 719 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 800 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



25 



30 



35 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium beijerincki (deposited as 

Clostridium butyricum) 

(B) STRAIN: ATCC 82 60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 719 



CATATACTAC 


50 


AAATAAAGCA 


100 


TGGAAGTAAG 


150 


CCAGTAATAA 


200 


AGAAGCAATT 


250 


TCCCAACTCC 


300 


GATGTATTCA 


350 


AGCTGGAGTA 


400 


AAGAAAAGAA 


450 


TTGGATGAAG 


500 


TCAAAGAACT 


550 


TACACCCTCA 


600 


GAAGGTGGAA 


650 


TTTCAGAACA 


700 


AAATGGTAAT 


750 


CCAATCGCAA 


800 



45 

2) INFORMATION FOR SEQ ID NO : 720 

(i) SEQUENCE CHARACTERISTICS: 
5 0 (A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 0 



GGTGCTATCC 
TGAGC AC AT C 
5 TCCTGAACAA 
GAAATGGAAG 
CGCTCCGGTT 
AATACGTTGG 
CCAGATCCAA 

10 CGTTATGACA 
GTGGGGTAGT 
ACT C AGAAAA 
CTTCGCAGAA 
GTGAC CAGAT 

15 CCACACACAA 
TGGACGTCAC 
GTACAACTGA 
GTTATGCCTG 



TGGTTGTTGC 

CTGCTTGCTC 

ATGCGACATG 

TACGTGAGCT 

ATCCGTGGTT 

CGCTATCAAA 

CTCGTGAAAC 

ATCACAGGAC 

AAAACTGGGA 

CTGTTGTTAC 

TCCGGAGACA 

TCAGCGTGGA 

AGTTCAAGGC 

ACT C CAT TCG 

CGTAACTGGT 

GTGACAACGT 



TGCATCTGAT 

GTCAGGTAGG 

GTTGATGACG 

GTTAAGCGAG 

CTGCACTGAA 

GAACTGATGG 

TGACAAACCA 

GTGGTACAGT 

GAAGAAGTTG 

CGGACTGGAA 

ACATCGGTGC 

CAGGTTCTTG 

TCAGGTTTAT 

TTTCTAACTA 

GTTATTACAT 

TGAAATGAAC 



GGTCCTATGC 

TGTTCCTTAC 

AAGAACTGAT 

TACGGATTCG 

GGCTCTGGAA 

ATGCAGTTGA 

TTCCTGATGT 

TGCTACAGGA 

AAATCGTTGG 

ATGTTCCGTA 

TCTGCTGCGT 

CTAAACCAGG 

GTATTAACAA 

CCGTCCTCAG 

TACCGGAAGG 

GTTGAGCTGA 



CTCAGACTCG 50 

ATCGTTGTAT 10 0 

CGACCTTGTT 150 

ACGGAGATAA 2 00 

GGTGACGACA 2 50 

TGAATTCATC 3 00 

CTGTAGAAGA 3 50 

CGTGTTGAGC 4 0 0 

TATCAAGGAT 4 50 

AGCAGCTGGA 500 

GGTATCAACC 550 

ATCCGTACAT 600 

AAGAAGAAGG 650 

TTCTACTTCC 7 00 

AACTGAAATG 750 

TTGCTCCAA 7 99 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 721 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 721 



TTGTATCAGC 
TTAGCATCAA 

4 0 AGACCAAGTA 
GAGAATTATT 
GTAGGATCAG 
AGACTTAATG 
C AGATCAAC C 

4 5 AGAGGAACAG 
AGATGAAGTA 
TCACAGGAGT 
GATAAC AT CG 
AGGTCAAGTA 

50 TAGGT CAAGT 
TTCTTTAACG 
AGGATCAATC 
ATATAGACAT 



AGCAGATGGT 
GAGTAGGAGT 
GATGATC CAG 
AAGCGAATAC 
CATTAAAAGC 
AAAGCTGTAG 
ATTCTTAATG 
TTGCAACAGG 
CAAATCGTAG 
AGAAATGTTC 
GAGCATTATT 
CTAGCAAAAC 
TTACGTATTA 
GATACAGACC 
GCTTTACCAG 
GAACGTAGAA 



CCAATGCCAC 

TAAC C ACATA 

AATTACTAGA 

GGATTTGACG 

AATCGAAGAA 

ATGAATATAT 

CCTGTAGAAG 

AAGAGTTGAA 

GAATGAAAGA 

AGAAAGATGT 

AAGAGGAGTA 

CAGGTTCAGT 

AAGAAAGAAG 

ACAATTCTAC 

AAGGAGTAGA 

TTAATCACAC 



AAACAAGAGA 
GTAGTATTCT 
ATTAGTAGAA 
GAGACGAATG 
GGGGATGACC 
CCCAACTCCA 
ATGTATTTAC 
AGAGGAGTAC 
AGAAATCGGA 
TAGATGAAGC 
CAAAGAGACG 
AACACCTCAC 
AAGGTGGAAG 
TTCAGAACAA 
AATGGTAATG 
CAGTAGCAA 



ACACATCCTA 50 

TAAACAAAGC 100 

ATGGAAGTAA 150 

TCCAGTAGTA 200 

AATGCATCCT 2 50 

GAAAGAGCAA 300 

AATTACAGGA 3 50 

TACACGTAGG 4 00 

AAGACAACAA 4 50 

AATGGCTGGA 500 

AAATCGAAAG 550 

AAAAAATTCG 600 

ACACACTCCA 650 

CAGACGTAAC 700 

CCAGGAGACC 750 
789 



55 
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2) INFORMATION FOR SEQ ID NO: 722 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Clostridium septicum 

(B) STRAIN: ATCC 12464 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722 

GTTCAGCAGC AGACGGTCCA ATGCCACAAA CAAGAGAACA TATACTACTA 5 0 

GCATCAAGAG TTGGTGTTGA CTATATCGTA GTATTCTTAA ACAAGGCAGA 100 

15 TATGGTAGAT GACGAAGAAT TATTAGAATT AGTAGAAATG GAAGTTAGAG 150 

AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATCAAG 2 00 

GGATCAGCTT TAGTAGCATT AGAAAACCCA ACAGATGAAA AATCAATCGC 2 50 

TCCAATCTTA GAATTAATGG AAGCAGTAGA TAGCTACATT CCAACACCAG 3 00 

AAAGAG CAAC AGATAAGCCA TTCTTAATGC CAGTAGAAGA TGTATTCACA 3 50 

2 0 ATAACTGGTA GAGGAACAGT TGCAACAGGA AGAGTTGAAA GAGGAGTTCT 4 00 

TCATGTAGGA GACGAAGTAG AAATCGTTGG ATT AT C AGAA GAAAGCAGAA 450 

AAGTAGTAGT AACAGGAATA GAAATGTTCA GAAAGTTACT AGACGAAGCA 5 00 

CAAGCTGGAG ATAATGTTGG AGTACTTTTA AGAGGTGTTC AAAGAACAGA 55 0 

TATCGAAAGA GGTCAAGTAT TAGCAAAGAC TGGATCAGTT AAGC CAC AC A 60 0 

2 5 GCAAGTTCGT AGGTCAAGTA TACGTACTTA AGAAAGAAGA AGGTGGAAGA 6 50 

CATACTCCAT TCTTCGATGG ATACAGACCA CAATTCTACT TCAGAACAAC 7 00 

AGACGTTACT GGATCAATCA AATTACCAGA CGGAATGGAA ATGGTTATGC 750 

CAGGAGACCA CATTGATATG AACGTTGAAT TAATCACTCA AGTAGCAA 798 

30 

2) INFORMATION FOR SEQ ID NO: 72 3 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723 

GTTCAGCAGC AGATGGTCCA ATGCCTCAAA CAAGAGAACA CATACTACTA 50 

GCTTCAAGAG TTGGTGTTGA CTACATAGTT GTTTTCTTAA ACAAGGCAGA 100 

50 TATGGTAGAT GACGAAGAAT TATTAGAATT AGTTGAAATG GAAGTAAGAG 150 

AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATAAAG 200 

GGTTCAGCTT TACAAGCATT AGAAAACCCA ACAGATGAAA AAGCAATCGC 250 

TCCAATCCTT GAGTTAATGG AAGC TGTAGA TAGCTACATT CCAACTCCAG 3 00 

AAAGAGCAAC AGATAAGCCA TTCTTAATGC CAGTAGAAGA TGTATTCACA 3 50 

55 ATCACTGGTA GAGGAACAGT TGCTACAGGA AGAGTTGAAA GAGGAGTTCT 4 00 

TCACGTAGGA GACGAAGTAG AAATCGTTGG ATTATCAGAA GACAGCAGAA 4 50 

AAGTAGTAGT AACAGGAATA GAAATGTTCA GAAAGTTACT AGACGAAGCG 500 

CAAGCTGGAG ACAACGTAGG AGTTCTTTTA AGAGGAGTTC AAAGAACTGA 550 

CATCGAAAGA GGT C AAGTTT TAGCAAAAGT TGGATCAGTT AAGC CA CAC A 6 00 

6 0 AGAAATTTGT AGGTCAAGTA TACGTACTTA AAAAAGAAGA AGGTGGAAGA 65 0 



545 
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CATACTCCAT TCTTCGATGG ATACAGACCA CAATTCTACT TCAGAACAAC 
AGAYGTTACT GGTTCAATCA AGTTACCAGA TGGAATGGAA ATGGTTATGC 
CAGGAGACCA CATTGATATG AACGTTGAAT TAATCAC TCA AGTAGCTAT 



2) INFORMATION FOR SEQ ID NO: 724 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724 

TAGTAAGTGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATACTA 
TTAGCATCCA GAGTTGGAGT TGAGCACATA GTAGTATTCT TAAATAAAGC 
AGAC CAAGTA GATGACGCAG AGTTAATCGA ATTAGTAGAA ATGGAAGTAA 
GGGAATTAAT GAACGAATAC GGATTCCCAG GAGATGACGC ACCAGTAGTA 
GTAGGATCCG CATTAAAAGC ATTAGAAAAT CCAGAAGATG ATGCAGCAAC 
ACAATGCATA ATGGACTTAA TGGCAGCAGT AGATGAATAT ATACCAACAC 
CAGAAAGAGC AACAGATAAG CCATTCTTAA TGCCAGTAGA AGATATCTTC 
ACAATCACAG GAAGAGGAAC AGTTG C AAC A GGAAGAGTAG AAAGAGGAAT 
TCTAAAAGTA GGAGACGAAA TAGAAATCGT AGGATTAAGT GATGAAAGCA 
AGAAAT CAGT AATCACAGGA ATAGAAATGT TCAGAAAACT ATTAGATGAA 
GCACAAGCAG GAGATAACAT CGGAGCATTA TTAAGAGGTG TTCAAAGAGA 
TGAAATCCAA AGAGGTCAAG TATTAGCAGC AACAGGATCA GTAAAAC C AC 
ATAAGAGTTT TACAGGTCAZV GTATATGTAT TAAAGAAAGA AGAAGGAGGA 
AGACACACTC CATT CTTTAA CGGATACAGA CCACAATTCT ACTTTAGAAC 
AACAGACGTA ACAGGTTCAA TCGCACTACC AGAAGGAGTA GAAATGGTAA 
TGCCAGGAGA CCACATAGAC ATGAAGGTAG AATTAATAAC AAGAGTAGCA 
A 



2 ) INFORMATION FOR SEQ ID NO: 725 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcua malodoratua 

(B) STRAIN: ATCC 4 3197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725 

TATGCCTCAA ACTCGTGAAC ACATCTTGTT ATCTCGTAAC GTTGGTGTTC 
CTTACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGATGAAGAA 
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TTACTAGAAT 
CTTCCCAGGC 
TAGAAGGCGA 
GTTGATGAAT 
GATGCCAGTC 
CTGGTCGTGT 
GTTGGTATTG 
GTTCCGTAAA 
TGTTACGTGG 
AAACCAGCTT 
TTT AAC T AAA 



TAGTTGAAAT 
GACGACACTC 
TGCTTCATAC 
ATATCCCAAC 
GAAGATGTAT 
TGAACGTGGA 
CTGAAGCAAC 
TTGTTAGATT 
TGTTGCACGT 
CAATCACTCC 
GAAGAAGGCG 



GGAAGTTCGT 
CAGTTATCGC 
GAAGAAAAAA 
ACCAGTTCGT 
TCTCAATCAC 
CAAGTTCGCG 
TGCTAAAACA 
ACGCTGAAGC 
GAAGACATCC 
ACATACAAAA 
GACGTCATAC 



GACTTATTGT 
TGGTTCAGCT 
TCTTAGAATT 
GATACTGACA 
TGGTCGTGGA 
TTGGTGACGA 
ACTGTTACAG 
AGGCGATAAC 
AACGTGGACA 
TTCTCTGCAG 
TCC 



CAGAATACGA 
TTGAAAGCTT 
AATGGCTGCT 
AACCATTCAT 
ACTGTTGCAA 
AGTTGAAATC 
GTGTTGAAAT 
ATTGGTGCAT 
AGTATTGGCT 
AAGTTTACGT 



2) INFORMATION FOR SEQ ID NO: 726 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 4 9903 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 72 6 

CACGTGAACA CATGTTGTTA TCTCGTAACG TAGGTGTTCC TTACATCGTT 
GTATTCTTAA ACAAAATGGA TATGGTTGAT GACGAAGAAT TATTAGAATT 
AGTAGAAATG GAAGTTCGTG ACTTATTATC AGAATACGAT TTCCCAGGCG 
ATGACACTCC AGTTGTTGCA GGTTCTGCTT TGAAAGCTTT AGAAGGCGAC 
GCTTCTTACG AAGAAAAAAT CATGGAATTA ATGGCTGCAG TTGACGAGTA 
CATCCCAACT CCAACTCGTG ACACTGACAA ACCATTCATG ATGCCAGTTG 
AGGATGTATT CTCAATCACT GGACGTGGTA CTGTTGCTAC AGGTCGTGTT 
GAACGTGGAC AAGTTCGCGT TGGTGACGTT GTAGATATCG TTGGTATCGC 
TGACGAAACT GCTCAAACAA CTGTAACAGG TGTTGAAATG TTCCGTAAAT 
TATTAGACTA CGCTGAAGCA GGCGATAACA TCGGTGCTTT ATTACGTGGT 
GTTGCTCGTG AAGACATCCA ACGTGGACAA GTTTTAGCTA AACCAGCTTC 
AATCACTCCA CATACAAAAT TCTCTGCTGA AGTATACGTA TTAAGCAAAG 
AAGAAGGTGG ACGTCATACT CCA 



2) INFORMATION FOR SEQ ID NO: 72 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvleae 

(B) STRAIN: ATCC 49156 
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(xi) SEQUENCE DESCRJ 

CGGTCCTATG CCTCAAACTC 
GCGTACCATA CATCGTTGTT 
5 GAAGAATTGC TAGAATTAGT 
ATATGAC TTC CCAGGCGACG 
AAGCTCTTGA AGGAGATC CT 
GCTGCAGTTG ACGAATACGT 
ATTCATGATG CCAGTCGAAG 

10 TTGCTACAGG CCGTGTTGAA 
GAAATCGTTG GTATTGCTGA 
TGAAATGTTC CGTAAATTGT 
GTGCATTGCT ACGTGGGGTT 
TTGGCTAAAG CTGGTACAAT 

15 TTATGTTTTG ACAAAAGAAG 



PTION: SEQ ID NO: 727 

GTGAACACAT CTTGTTATCA 
TTCTTGAACA AAATGGATAT 
TGAAATGGAA GTTCGTGACC 
ATGTTCCTGT AATCGCTGGT 
TCATACGAAG AAAAAATCAT 
TCCAACTCCA GAACGTGATA 
ACGTATTCTC AATCACTGGA 
CGTGGACAAG TTCGCGTTGG 
CGAAACTGCT AAAACAACTG 
TAGACTATGC TGAAGCAGGG 
GCTCGTGAAG ACATCCAACG 
CACACCTCAT ACAAAATTCA 
AAGGTGGACG TCACACTCCA 



CGTAACGTTG 50 

GGTTGATGAC 10 0 

TATTGTCTGA 150 

TCTGCTTTGA 2 0 0 

GGAATTGATG 250 

CTGACAAACC 300 

CGTGGTACTG 350 

TGATGAAGTA 4 00 

TAACAGGTGT 450 

GATAACATTG 500 

TGGACAAGTA 550 

AAGCTGAAGT 600 

TTCTTC 64 6 



2) INFORMATION FOR SEQ ID NO: 728 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma, pirum 

3 0 (B) STRAIN: ATCC 25960D 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728 

GAGCAATTTT AGTTGTTTCT GCAACTGATG GTCCAATGCC TCAAACTCGT 50 

35 GAACAT AT CT TATTAGCACG CCAAGTTGGT GTTCCTAAAA TGGTTGTTTT 100 

CTTAAACAAA TGTGATGTTG CTTCTGATCC AGAAATGCAA GAATTAGTTG 150 

CTGAAGAAGT AAAAGACTTA TTAAAATCTT ATGGTTTTGA TGGTGACAAT 200 

ACTCCAATTA TTCGTGGTTC AGCATTAGAA GCATTAAATG GTAAACCTGA 250 

ATGAGAAGAA AAAATTAAAG AATTAATGAA GGCAGTGGAT GACACTATTC 300 

4 0 CTGATCCAGT TCGTGATACT GAAAAGCCAT TCTTGTTACC AATTGAAGAC 3 50 

GTAATGACAA TTACAGGTCG TGGTACTGTT GTTACAGGTC GTGTAGAACG 400 

TGGTACTCTA AAATTAAATG ATGAAGTTGA AATTGTTGGT TTAGGTGAAA 450 

CATTTAAATC TGTTGTAACA GGTATTGAAA TGTTCCGTAA AGAATTAGAT 500 

GAAGCTCGTG CTGGTGACAA TGCTGGTATT TTACTTCGTG GTGTTGACCG 550 

4 5 TGGT CAAGTA CAACGTGGTC AAGTTCTTGC TAAAC CAGGT TCTATTACTC 600 

CTCATACTAA ATTTAAAGCT GAAATTTATG CTTTGAAAAA AGAAGAAGGT 650 

GGTCGTCATA CTGCTTTCTT AAACGGTTAT CGTCCTCAAT TCTATTTCAG 70 0 

AACAACTGAT GTTACAGGTT CTATTAAATT AAAAGATGGA ACTGAAATGG 750 

TTATGC CTGG TGACAATACT GAAATCACTG TAGAATTAAT TTCACCAATT 800 

50 GCTTGTAAAA AGGAAGTAAG TTT 823 



2 ) INFORMATION FOR SEQ ID NO: 72 9 

55 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Mycoplasma salivarium 

(B) STRAIN: ATCC 23064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 9 

10 GGAGCAATCT TAGTTGTTGC TGCAAC TGAT GGTGCGATGC CTCAAACTCG 50 

TGAACAC GTT TTACTTGCAA AACAAGTTGG TGTTCCTAAA ATCGTTGTTT 100 

TCTTAAACAA AATCGATATG TTCAAGCCAG AAGAAAGAGC CGAAATGGTT 15 0 

GAAATGGTTG AAATGGACAT TCGTGACTTA TTAAACAAAT ACGACTTTGA 200 

TGGTGACAAT ACTCCTATTA TTGCTGGTTC AGCATTAAAA GCATTACAAG 250 

15 GTGATCCAGA ATATGAAAAG AATATTCTAG AACTTATGGA TGCAGTAGAC 3 00 

ACATACATTG ATGAACCTAC TCGTGATAAA GATAAACCAT TCTTAATGGC 3 50 

TGTAGAAGAC GTATTTACAA TTACTGGTCG TGGAACTGTT GCTACTGGTA 4 00 

GAGTAGAACG TGGTACATTA CATCTAAATG ATGAAGTTGA AATCGTTGGT 450 

CTACGTCCAA CAATTAAAAC TGTTGTTACT GGAATTGAAA TGTTCCGTAA 50 0 

20 AAATTTAAAA GAAGCTCAAG CTGGAGATAA TGCAGGACTT TTACTACGTG 550 

GAATTGATAG AGAT CAAGTA GAACGTGGAC AAGTTTTGGC CAAACCAAAA 600 

AGTATTATTC CTCACACAGA ATTTGAAGCT GCTGTGTATG TTCTAAAAGC 65 0 

TGAAGAAGGT GGACGTCACA CTCCATTCTT TGAACACTAT AAACCACAAT 700 

TTTACTTTCG TACAACCGAC GTTACTGGTG GAATTAAATT CAAACCTGGA 750 

25 CGTGAAATGG TTATGCCTGG CGAAAATGTT GAATTTACAG TTACTTTAAT 800 

TGCTCCTATT GCAGTTGAAG AAGGAA 826 



3 0 2) INFORMATION FOR SEQ ID NO: 73 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 0 

45 

TGGTATGTTC CGCAGCYGAY GGYCCTATGC CTCAAACTCG CGAACACATC 5 0 

CTGYTGGCTC GCCAAGTAGG YGTACCTTAC ATCATCGTRT TCATGAACAA 100 

ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT GAAATGGAAA 150 

TCCGYGACCT GYTGTCMAGC TACGACTTCC CMGGCGACGA CTGCCCAATC 2 00 

50 GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGYGAYGCTG GTTACGAAGA 250 

GAAAATCTTC GAAYTGGCTG CTGCTTTGGA CAGCTACATC CCAACTCCTG 300 

AGCGTGCTGT GGACAAACCK TTCYTGYTGC CTATCGAAGA CGTRTTCTCT 3 50 

ATCTCTGGYC GYGGTACMGT AGTAACYGGY CGTGTAGAGC GCGGTRTCAT 4 00 

CCACGTTGGT GACGAGATYG AAATCGTAGG TCTGAAAGAA ACCCAAAAAA 4 50 

55 CCACTTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGTCAA 5 00 

GCWGGTGACA ACGTAGGCGT ATTGYTGCGT GGTACCAARC GTGAAGACGT 550 

AGAGCGTGGT CAAGTATTGG CYAAACCAGG YACCATYACT CCSCACACCA 6 00 

AATTCAAAGC AGAAGTATAC GTAC TGAGCA AAGAAGAGGG TGGTCGYCAC 650 

ACTCCATTCT TCGCYAACTA CCGYCCMCAA TTCTACTTCC GTACYACTGA 700 

60 CGTAACYGGT GCAGTTACTT TGGAAGAAGG TGTAGAAATG GTAATGC CAG 7 50 
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GTGAGAACGT AACCATTACY GTAGAACTGA TTGCGCCTAT CGCTAGGAAG 8 00 

AAGGTCTGCG 810 



2) INFORMATION FOR SEQ ID NO: 731 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype enteri tidis 

(B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 5 0 

CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 100 

AATGCGACAT GGTTGATGAC GAAGAGC TGC TGGAACTGGT TGAAATGGAA 15 0 

GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 2 00 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 2 50 

CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT CCCGGAACCA 3 00 

GAGCGTG C GA TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 3 50 

CATCTC CGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 4 00 

TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACTCAGAAG 4 50 

TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 5 00 

TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA CGTGAAGAAA 550 

TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 600 

AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGC CGTCA 650 

CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 700 

ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG 750 

GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCG CAATGGA 8 00 

CGACGGTCTG CGT 813 



2) INFORMATION FOR SEQ ID NO: 732 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype gallinarum 

(B) STRAIN: ATCC 9184 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 50 
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CCTGCTGGGT CGTCAGGTAG 

AATGC GACAT GGTTGATGAC 

GTTCGYGAAC TGCTGTCTCA 

CGTTCGTGGT TCTGCTCTGA 

5 CGAAAATCAT CGAACTGGCT 

GAGCGTGCGA TTGACAAGCC 

CATCTCCGGT CGTGGTACCG 

TCAAAGTGGG CGAAGAAGTT 

TCTACCTGTA CTGGCGTTGA 

10 TGCCGGTGAG AACGTAGGTG 

TCGAACGTGG TCAGGTACTG 

AAGTT CGAAT CTGAAGTGTA 

CACTCCGTTC TTCAAAGGCT 

ACGTGACTGG CACCATCGAA 

15 GGCGACAACA TCAAAATGGT 

CGACGGTCTG CG 



GCGTTCCGTA CATCATCGTG 
GAAGAGCTGC TGGAACTGGT 
GTACGACTTC CCGGGCGACG 
AAGCGCTGGA AGGCGACGCA 
GGCTTCCTGG ATTCTTACAT 
GTTCCTGCTG CCGATCGAAG 
TTGTTACCGG TCGTGTAGAA 
GAAATC GTTG GTATCAAAGA 
AATGTTCCGC AAACTGCTGG 
TTCTGCTGCG TGGTATCAAA 
GCTAAGCCGG GCACCATCAA 
CATTCTGTCC AAAGATGAAG 
ACCGTCCGCA GTTCTACTTC 
CTGCCGGAAG GCGTAGAGAT 
TGTTACCCTG ATCCACCCGA 



1 1 CC- 1 CjAACA 


100 


rr>/~i TV 7V TV TP^TV TV 


150 


TV /""'TV #"Tp/"*l/"i/""*TV m 

ACACTCCGAT 


200 


TV /~* rni^i/n O TV TV /""< 

GrAGTGGGAAG 


250 


CCCGGAACCA 


3 00 


ACGTATTCTC 


350 


C GC GGT ATC A 


400 


GAC T CAGAAG 


450 


ACGAAGGCCG 


500 


f"^ /™tm /""I TV TV /-l TV TV TV 

CGTGAAGAAA 


550 


r~* r~i/~i/~i /-i7< tv /■•/""» 
CjC CCjCAC acc 


60 0 


GCGGCCGTCA 


650 


CGTACTACTG 


700 


GGTAATGCCG 


750 


TCGCAATGGA 


800 




812 



20 2) INFORMATION FOR SEQ ID NO: 733 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype paratyphi B 

(B) STRAIN: ATCC 8759 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733 

TGGTTGTTGC TGCGACTGAC GGYCCGATGC CGCAGACCCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

4 0 TTCGYGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 2 00 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 2 50 

GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG 300 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 4 00 

45 CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT 4 50 

CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 5 00 

GCYGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

50 ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 70 0 

CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 7 50 

GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CG CAATGGAC 8 00 

GACGGTCTGC GTTT 814 



55 



2) INFORMATION FOR SEQ ID NO: 734 



(i) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 828 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype virchow 

10 (B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

15 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 10 0 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT 3 00 

20 CCGGAACCAG AGCGTGCGAT TGACAAGC CG TTCCTGCTGC CGATCGAAGA 35 0 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 400 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACT CAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 50 0 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 55 0 

25 GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CAC CATCAAG 60 0 

CCGCACACCA AGTT CGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 70 0 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

3 0 CGCAATGGAC GACGGTCTGC GTTTCGCA 82 8 



35 



40 



45 



2 ) INFORMATION FOR SEQ ID NO: 735 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimesii 

(B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 5 



50 GGCGCTATCC 
TGAGCACATC 
TCATGAACAA 
GAAATGGAAG 
CCTGCCAGTT 

55 AGTGGGAAGC 
CCAGAACCAG 
CGTATTCTCC 
GCGGTATCGT 
ACCGTTAAGT 

60 CGAAGGCCGT 



TGGTTGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 
TAAAATCATC 
AGCGTGCTAT 
ATCTCCGGYC 
TAAAGTTGGC 
CTACCTGTAC 
GCTGGTGAGA 



TGCGACTGAT 
GTCAGGTTGG 
GTTGATGATG 
TCTGTCTGCT 
CAGCGCTGAA 
GAACTGGCTG 
CGACAAGCCG 
GTGGTACYGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 



GGCCCAATGC 
CGTTCCTTTC 
AAGAGCTGCT 
TATGACTTCC 
AGCACTGGAA 
GCTACCTGGA 
TTCCTGCTGC 
AGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 



CACAGACCCG 50 

ATCAT CGTAT 100 

GGAACTGGTA 150 

CTGGTGATGA 2 00 

GGCGAAGCTG 250 

TTCTTACATC 3 00 

CAATCGAAGA 3 50 

CGTGTAGAGC 4 00 

TATCAAAGAC 450 

AACTGCTGGA 500 

GGTAT CAAGC 550 
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GTGAAGACAT CGAACGTGGT CAGGTACTGG 
CCACACACCA AATTCGACTC AGAAGTTTAC 
TGGTCGTCAC ACTCCATTCT TCAAAGGCTA 
GTACAACTGA CGTGAC CGGT ACCATCGAAC 
5 GTAATGCCAG GCGATAACGT GAACATGGTT 
CGCGATGGAC GACGGTCTGC GTTTC 



CTAAACCAGG TTCAATCAAG 600 
ATCCTGAGCA AAGAAGAAGG 650 
CCGTCCACAG TTCTACTTCC 700 
TGCCAGAAGG CGTAGAGATG 750 
GTAACCCTGA TTCACCCAAT 8 00 

825 



10 2) INFORMATION FOR SEQ ID NO: 736 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM : Clostridium difficile 

(B) STRAIN: ATCC 9689 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 6 



25 



TATTAGTTTG TTCAGCAACA GATGGACCAA TGCCACAAAC AAGAGAGCAT 5 0 

ATACTATTAT CAAGACAAGT TGGAGTAC C A TATATAGTAG TATTCTTAAA 100 

CAAATGTGAC ATGGTAGATG ATGAAGAGTT ATTAGAGTTA GTAGAGATGG 150 

AAGTAAGAGA TTTATTAACA GAATATGATT TCCCAGGAGA TGACACTCCA 200 

3 0 ATAGTAAGAG GTTCAGCATT AATGGCATTA GAAGATCCAA AGAGYGAGTG 250 
GGGAGATAAG ATAGTAGAAT TATTCGAGCA AATAGATGAG TATATACCAG ' 3 00 

CTCCAGAGAG AGATACAGAT AAACCATTCT TAATGCCAGT AGAGGACGTA 350 

TTCTCAATCA CAGGAAGAGG AACAGTTGCA ACAGGAAGAG TGGAAAGAGG 4 00 

AGTACTAAAA GTACAAGACG AAGTAGAKTT AGTAGGATTA ACAGAAGCAC 450 

35 CAAGAAAAGT AGTAGTAACA GGAGTAGAGA TGTTCAGAAA ATTATTAGAC 500 

CAAGCACAAG CAGGGGATAA TATAGGAGCA TTATTAAGAG GAGTACAAAG 550 

AAACGAGATA GAAAGAGGAC AAGTACTAGC AAAGACTGGA TCAGTAAAGG 60 0 

CACACACAAA GTTTACAGCA GAAGTATATG TACTTAAAAA AGAAGARGGT 650 

GGAAGACATA CACCATTCTT TGATGGATAT AGACCACAAT TCTATTTCAG 700 

40 AACAACAGAC GTAACAGGAG CTTGTAAGTT ACCAGAAGGA ATAGAGATGG 750 

TAATGCCTGG AGATAACGTA ACAATGGAAG TAGACTTAAT AAACTCAA 798 



45 2) INFORMATION FOR SEQ ID NO: 737 

(i)~ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 7 
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GTCAACATGA 
CTCCGTGTTC 
ACCACGAAAT 
GGCCAGATGA 
CCTCACGATG 
TCGTCGACAA 
CTGCTCGGCC 
AGAAATGGGC 
TCACGTCGGT 



TGGAGCTCAT 
GCGGGCGTGG 
GAAGGACTCG 
ACGAGCCGCC 
GCCGAGCACT 
CATCTACCGT 
GTATGCCGTC 
AAGCTGCAAG 
T 



CAACAACATC 
GCGAGCGTAC 
AACGTTCTCG 
GGGCAACCGT 
TCCGTGACGA 
TTCACGCTGG 
GGCAGTGGGC 
AGCGCATCAC 



GCGAAGGAGC 
CCGTGAAGGG 
ACAAGGTCGC 
CTGCGCGTGG 
AGGCCTCGAC 
CCGGTACCGA 
TATCAGCCGA 
GTCGACGAAG 



ACGGCGGTTA 
AACGACTTCT 
GCTGGTGTAC 
CGCTGACGGG 
GTGCTGTTCT 
AGTGTCGGCG 
CGCTGGCTGA 
AAGGGCTCGA 



2) INFORMATION FOR SEQ ID NO: 738 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium hi ferment axis 

(B) STRAIN: ATCC 63 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738 

TACAAGAGCT TATTAACAAT ATAGCTACTC AACACGGTGG TATATCAGTA 
TTCGCAGGTG TTGGAGAGAG AACAAGAGAA GGTAACGACT TATTCCATGA 
GATGAGCGAT ACAGGAGTTA TAAATAAAAC AGCTCTAGTA TT CGGAC AAA 
TGAATGAGCC ACCTGGAGCA AGAATGAGAG TTGCTTTAAC TGGTCTTACA 
ATGGCTGAAT ACTTCAGAGA TCAACAAGGG CAAGACGTTT TATTATTCGT 
AGATAATATA TTCCGTTTCA CTCAAGCAGG ATCTGAGGTT TCTGCACTTC 
TTGGACGTAC TCCATCAGCA GTTGGATACC AACCAACATT AGCAACAGAG 
AT GGGTAGAT TACAAGAGAG AATAACATCT ACAAATAAAG GGTC 



2) INFORMATION FOR SEQ ID NO: 739 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium beijerincki (deposited 

Clostridium butyricum) 

(B) STRAIN: ATCC 8260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739 

TTAATAAACA ACATAGCTAA ACAACATGGT GGTTTAT CAG TATTTACTGG 
AGTTGGTGAA AGATCAAGAG AAGGTAATGA CTTATAT CAT GAAATGAGAG 
AGTCAGGAGT TATTGATAAG ACAGCATTAG TATTTGGACA AATGAATGAG 
CCACCGGGTG CCAGAATGAG AGTTGCATTA ACAGGTCTTA CTATGGCAGA 
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GTATTTTAGA GATAAAGGTC AAGATGTGTT 
TCAGATATAC TCAAGCAGGT TCAGAGGTTT 
CCTTCAGCGG TTGGATATCA GCCAACACTT 
TCAGGAAAGA ATTACATCAA CAGTTAATGG 



ACTATTCATA GATAACATAT 250 

CAGCATTACT TGGAAGAACA 3 00 

GCAACTGAAA TGGGTGCACT 3 50 

TTCTATTACG TCAG 394 



2) INFORMATION FOR SEQ ID NO: 740 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Clostridium difficile 
20 <B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740 



TTATAAACAA TATTGCTAAG 

2 5 GTAGGAGAAA GAACAAGAGA 

GTCTGGAGTT ATAAATAAAA 
CACCTGGAGC GAGAATGAGA 
CATTTTAGAG ATGAGCAAGG 
ATTCCGTTTC ACACAAGCTG 

3 0 TGCCATCAGC TGTTGGTTAT 

CTTCAAGAGA GAATAACATC 



CAACATGGTG GTATTTCTGT 
AGGTAACGAC CTTTATGGCG 
CAGCTCTAGT ATTTGGTCAA 
GTTGCTTTAA CTGGACTTAC 
ACAAGACGTT TTACTTTTCG 
GTTCAGAAGT TTCAGCACTT 
CAGCCAACAT TAGCTACTGA 
AACTAAGAAA GGTTCAATAA 



ATTTTCAGGA 5 0 

AAATGAGTGA 10 0 

ATGAATGAAC 150 

AATGGCAGAA 2 00 

TTGATAATAT 25 0 

CTAGGACGTA 3 00 

AATGGGTGCA 350 

CAT 3 93 



35 2) INFORMATION FOR SEQ ID NO: 741 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 98 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium r anno sum 

(B) STRAIN: ATCC 25582 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741 

50 

TTGATTCAAG AATTCATTAA TAACATTGCT ACAGAACATG GTGGTTTATC 50 

AGTTTTTGCT GGAGTTGGTG AACGTAGCCG TGAAGGTAAT GATTTATATT 100 

ATGAAATGAA GGAAAGTGGT GTTTTAT CTA AAACAACACT AGTATTTGGA 150 

CAGATGAATG AACCCCCAGG AGCTCGTTTA AGAGTTGCTT TAACGGGTCT 2 00 

55 TACTATGG CA GAAGAATTCC GTGATGAACA AGGTCAGGAT GTCTTATTAT 2 50 

TCATCGATAA TATTTTCCGT TTTACTCAAG CTGGATCTGA AGTATCTGCC 300 

TTACTTGGAC GGGTACCATC ACAAGCTGGG TATCAGCCAA CTTTAGCAAC 35 0 

CGAAATGGGT GCTTTACAAG AACGGATTAC ATCAACTAAA AAAGGATC 3 98 

60 
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2) INFORMATION FOR SEQ ID NO: 742 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12964 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742 

TAGC TAAGGA ACACGGTGGA CTTTCAGTAT TCACAGGTGT TGGAGAAAGA 50 

TCAAGAGAAG GTAATGATTT ATATTACGAA ATGAAAGAAT CAGGAGTTAT 100 

AGACAAGACA GCTCTAGTGT TTGGACAAAT GAATGAAT CT CCAGGAGCTA 150 

GAATGAGAGT ATCTTTAACA GGATTAACTA TGGCTGAATA TTTCAGAGAT 2 00 

CAAGGTCAAG ATGTGCTTTT ATTCATAGAT AACATATTTA GATTTACTCA 250 

AGCTGGATCA GAAGTAT CGG CTTTACTTGG AAGAATACCA TCAGCAGTTG 3 00 

GTTATCAACC AACACTAGCA ACTGAAATGG GTGCACTTCA AGAAAGAATT 350 

25 ACTTCAACTA AAAATGGATC AATAACTTCA 380 



15 



20 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 743 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 89 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743 



45 TTAATAAATA 
AGTTGGAGAA 
AGT CAGGGGT 
TCACCAGGAG 
ATATTTTAGA 

50 TTAGATTTAC 
CCTTCAGCAG 
TCAAGAAAGA 



ATATAGCAAA 
AGGTCAAGAG 
TATAGATAAG 
CAAGAATGAG 
GATCAAGGTC 
TCAAGCGGGA 
TTGGATATCA 
ATAACATCAA 



AGAGCATGGT 
AAGGTAACGA 
ACAGCTTTAG 
AGTTTCATTA 
AAGACGTTCT 
TCAGAAGTTT 
ACCAACTCTT 
CAAAGAATGG 



GGTCTTTCTG 
CTTATATTAT 
TATTTGGACA 
ACTGGATTAA 
TTTATTTATA 
CTGCGTTATT 
GCAACTGAAA 
ATCAATCAC 



TATTTACAGG 50 

GAAATGAAAG 100 

AATGAATGAA 150 

CTATGGCTGA 2 0 0 

GATAATATAT 250 

AGGAAGAATT 300 

TGGGAGCACT 35 0 
389 



2) INFORMATION FOR SEQ ID NO: 744 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 843 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cojnamonas acidovorans 

(B) STRAIN: ATCC 15668 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744 

TTCCCCCGCA CGCATGCCCA AGGTGTTCGA TGCCCTGAAG CTCGACGGCT 50 

CGGCCCTGAC GCTGGAAGTG CAGCAACTGC TGGGTGACGG CGTTGTGCGT 10 0 

ACCATCGCCC TGGGTTCGTC CGACGGTCTG CGTCGCGGCC TGATGGTGTC 15 0 

15 CAACACCGGC AACCCCATCA CCGTGCCCGT GGGCAAGGCG ACGCTGGGTC 2 00 

GCATCATGGA CGTGCTGGGC AATCCCATCG ACGAACGTGG TCCCGTGGAT 2 50 

CAGGCGCTGA CGGCTCCCAT CCACCGCAAG GCACCGGCTT ATGACGAGCT 3 00 

GTCGCCTTCG CAGGAACTGC TGGAAACCGG CATCAAGGTG ATCGACCTGA 3 50 

TCTCGCCCTT CGCCAAGGGC GGCAAGGTGG GTCTGTTCGG TGGCGCCGGT 4 00 

20 GTGGGCAAGA CCGTGAACAT GATGGAACTC ATCAACAACA TCGCCAAGGG 4 50 

CCACGGTGGT CTGTCGGTGT TCGCCGGTGT GGGTGAACGT ACCCGCGAAG 50 0 

GCAATGACTT CTATCACGAA ATGTCGGACG CCGGCGTGGT CAACCAGGAG 550 

TCGCTGAACG ACTCCAAGGT GGCCATGGTC TACGGCCAGA TGAACGAACC 600 

CCCGGGCAAC CGTCTGCGCG TGGC GCTGAC CGGCCTGACC ATGGCCGAAG 650 

25 CCTTCCGTGA CGAAGGCAAG GACGTGCTGT TCTTCGTGGA CAACATCTAC 700 

CGCTACACGC TGGCCGGTAC CGAAGTGTCC GCTCTGCTGG GTCGCATGCC 75 0 

TTCCGCCGTG GGCTACCAGC CCACGCTGGC CGAGGAAATG GGCCGCCTGC 8 00 

AAGAGC GCAT CACCTCGACC AAGGTCGGTT CGATCACTTC CAC 843 



30 



2) INFORMATION FOR SEQ ID NO: 745 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: JCleJbsiella pneumoniae subsp ■ 
rhinoscl eroma ti s 
45 (B) STRAIN: ATCC 13884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 5 

GCCGTACCAC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

50 TCTGGTGCTG GAAGTTCAGC AGCAGCTGGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCTGAT GGTCTGCGCC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

CATGAACGTA CTGGGTCAAC CGGTTGACAT GAAAGGCGAC ATCGGCGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CGTCCTATGA AGAGCTGTCC 3 00 

55 AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCC AAGGGCGGTA AAGTTGGTCT GTTCGGCGGT GCGGGTGTAG 400 

GTAAAACTGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

60 TGGTGTACGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 
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CTGACCGGCC 
ACTGCTGTTC 
TATCCGCGCT 
CTGGCGGAAG 
CGGTTCTATC 



TGACCATGGC 
GTCGATAACA 
GCTGGGTCGT 
AGATGGGCGT 
ACCTCCGTA 



TGAGAAATTC 
TCTATCGTTA 
ATGCCTTCAG 
TCTGCAGGAA 



CGTGACGAAG 
CACCCTGGCC 
CGGTAGGTTA 
CGTATCACCT 



GTCGTGACGT 
GGTACTGAAG 
TCAGCCGACC 
CCACCAAAAC 



2) INFORMATION FOR SEQ ID NO: 746 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria canis 

(B) STRAIN: ATCC 14687 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 6 

GCGATTCTAT TCCGCGCGTA TATGATGCTC TTAAACTAGT GGATAGAGAA 
CTGACGCTTG AAGTACAACA ACAGTTGGGT GATGGTGTCG TTCGTACTAT 
TGCGATGGGT AGTTCCGACG GCCTCAAACG AGGTTTGGCG GTAGTTAACA 
CCGGTGCTCC AATTACAGTG CCTGTGGGCA AAGCAACATT AGGCCGTATT 
ATGGAC GTAT TAGGTAATCC GGTTGATGAA GCTGGTCCGA TTGGCTCCGA 
GCAAACCCGA GCAATCCACC AACCTGCTCC TAAGTTCGAC GAGCTTTCTA 
GCGCCACAGA GATTTTGGAA ACAGGTATTA AAGTAATTGA TTTGCTTTGC 
CCGTTTGCCA AAGGCGGTAA AGTAGGTTTG TTTGGTGGTG CGGGAGTGGG 
CAAAAC CGTA AATATGATGG AGTTGATTAA CAACATCGCG AAAGCACACA 
GCGGTTTGTC TGTATTTGCC GGTGTGGGTG AACGGACGCG TGAAGGTAAT 
GACTTTTATC ATGAGATGAA AGATT CCAAT GTATTAGATA AAGTAGCCAT 
GGTTTACGGT CAGATGAACG AGCCTCCCGG TAACCGTTTG CGCGTTGCGC 
TAACTGGCTT GTCTATGGCC GAATTCTTCC GTGACGAGAA AGATGAAAAC 
GGTAAAGGCC GTGATGTATT GTTCTTTGTA GACAATATTT AC CGCTATAC 
CCTAGCCGGT ACAGAAGTAT CTGCATTGCT TGGCCGTATG CCTTCGGCAG 
TAGGTTATCA GCCGACGTTG GCAGAGGAAA TGGGCCGCTT GCAAGAGCGT 
ATTACYTCAM CCCAAACAGG CTCT 



2) INFORMATION FOR SEQ ID NO: 747 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 

(B) STRAIN: ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 747 
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CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 5 0 

TCTGACTCTG GAGGTTCAAC AGCTTCTGGG CGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTTAAAC GCGGTATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 2 00 

5 TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

ACAAAAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 3 00 

TCAGCTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAAGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 45 0 

10 AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAGCGTACCC GTGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 55 0 

TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGACGAAAA 65 0 

CGGTAAAGGC CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

15 CTTTGGCCGG TACTGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 75 0 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 80 0 

TATTAC CTCT ACCCAAACCG GTTCCATTAC T 831 



20 



30 



2) INFORMATION FOR SEQ ID NO: 748 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 62 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cuniculi 

(B) STRAIN: ATCC 146 8 8 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 8 

CCGTGGCCAA GTAC CACAAA TTTATGACGC ACTGAGTGTT GATGGCACCG 50 

AAACAACCTT GGAAGTTCAA CAGCAGTTGG GTGATGGCGT GGTGCGTACC 100 

ATTGCGATGG GTTCAACCGA AGGCTTGAAG CGTGGTTTGA CTGTATCTAA 150 

4 0 CTCTGGTGCA CCGATTTCTG TGCCAGTGGG TCAAGCGACT TTGGGTCGTA 2 00 

TTATGGATGT GTTGGGTCGT CCAATCGACG AGGCAGGTCC TGTAAATGCT 250 

CAAGAAAAAT GGTCAATTCA CCGTGAAGCA CCAAGCTATG ATGAGCAATC 3 00 

AAACTCAACT GAGCTGCTAG AAACAGGCAT CAAAGTAATT GATTTGCTTT 3 50 

GCCCATTTGC TAAAGGTGGT AAAGTTGGCT TGTTCGGTGG TGCAGGTGTG 4 00 

4 5 GGTAAAACCG TGAATATGAT GGAGCTGATT AATAATATCG CTCTGAAGCA 4 50 

TTCAGGTCTT TCTGTTTTTG CAGGTGTTGG TGAGCGTACT CGTGAGGGTA 500 

ACGATTTTTA TCACGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAC 550 

TTCACCAAGT CAAAAGTAGC GATGGTTTAT GGTCAGATGA ATGAGCCACC 600 

AGGAAACCGT TTGCGTGTTG CATTGACAGG CTTGACGATG GCAGAATATT 650 

50 TCCGTGACGA AAAAGATGAA GCAACAGGCA AAGGGCGTGA TGTTCTATTG 70 0 

TTCGTTGATA ACATCTATCG TTACACACTG GCTGGTACGG AAGTGTCAGC 750 

ACTTCTAGGT CGTATGCCAT CAGCAGTAGG TTATCAACCG ACTCTGGCTG 800 

AAGAAATGGG TGCGTTGCAA GAGCGTATTA CCTCAACGCA ATCGGGTTCG 850 

ATCACTTCGG GG 862 

55 



60 



2) INFORMATION FOR SEQ ID NO: 74 9 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongaza 

(B) STRAIN: ATCC 25295 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749 



GGAACTCCCA 
TTGAAAATGA 
GTGCGTACCA 
TGTGAATAAT 
TGGGTCGTAT 
GTAAATGCAT 
TGAGCTTTCT 
ACTTGTTATG 
GCGGGTGTAG 
CAAGGCACAT 
GTGAAGGTAA 
AAAGTGGCAA 
GCGCGTTGCT 
AAGACGAAAA 
TACCGTTACA 
GCCTTCAGCA 
TGCAAGAGCG 



CGTGACGCTA 
CCTAACCTTA 
TTGCGATGGG 
AC CGGAGCTC 
CATGGATGTA 
CCAATACACG 
TCAACAACGG 
TCCGTTTGCC 
GTAAAACCGT 
AGCGGTTTGT 
TGACTTCTAT 
TGGTTTACGG 
TTGACCGGTT 
CGGTAAAGGT 
CTTTGGCCGG 
GTAGGTTACC 
TATTACCTCT 



TCC CGCATGT 
GAAGTTCAAC 
TAGTTCAGAT 
CGATTACTGT 
TTGGGTAATC 
TGCGATC CAT 
AATTATTAGA 
AAAGGTGGTA 
AAATATGATG 
CTGTGTTTGC 
CACGAGATGA 
TCAGATGAAC 
TGACTATGGC 
CGCGACGTAT 
TACGGAAGTA 
AACCGACATT 
ACCCAGACAG 



TTTTGATGCA 
AACTTTTGGG 
GGATTAAAGC 
TCCTGTTGGC 
CGGTTGATGA 
CAAGAGGCTC 
AACTGGCATT 
AAGTAGGTCT 
GAGTTAATTA 
AGGCGTGGGT 
AAGATTCCAA 
GAACCTCCAG 
CGAATACTTC 
TGTTCTTCGT 
TCCGCATTGC 
GGCTGAAGAA 
GCTCTATTAC 



TTAAAATTAG 
GGATGGTGTA 
GTGGTATGGC 
CGTGAAACTT 
GGCAGGTCCG 
CTAAGTTTGA 
AAGGTTATCG 
GTTTGGTGGT 
ACAACATTGC 
GAACGTACTC 
CGTATTGGAC 
GCAACCGTCT 
CGTGACGAAA 
GGACAACATT 
TGGGTCGTAT 
ATGGGTCGTT 
TTCC 



2) INFORMATION FOR SEQ ID NO: 750 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 1312 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750 



CGCGACGCTA 
TCTGACTCTG 
TTGCAATGGG 
ACCGGTGCGC 
TGTCGACGTA 
ACAAGAGCCG 
TCAACTACCG 
TCCGTTTGCT 
GCAAAACCGT 
AGCGGCCTGT 
CGACTTCTAC 



TTCCGCATGT 
GAAGTTCAAC 
TAGTTCAGAC 
CAATCACTGT 
TTGGGTACGC 
TGCCATTCAC 
AATTGTTGGA 
AAGGGC GGTA 
GAACATGATG 
CCGTGTTCGC 
CACGAGATGA 



TTACGATGCC 
AGCTTCTGGG 
GGCCTGAAAC 
GCCGGTAGGT 
CTGTTGATGA 
CAAACTGCTC 
AAC C GGTATT 
AAGTAGGTCT 
GAATTGATCA 
AGGTGTGGGT 
AAGATTCCAA 



CTGAAATTGG 
TGACGGCGTT 
GCGGCATGTC 
AAAGGTACAT 
AGCAGGTCCG 
CGAAATTCGA 
AAAGTGATCG 
GTTCGGTGGT 
ACAACATCGC 
GAACGTACCC 
CGTATTGGAT 



ACGAGAACGG 
GTCCGTACTA 
TGTAAGCAAT 
TGGGC CGTAT 
ATCGATACCG 
TGAGTTGTCT 
ACTTGCTGTG 
GCCGGTGTGG 
CAAAGCGCAC 
GTGAAGGTAA 
AAAGTAGCCA 
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TGGTGTATGG TCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGATGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

CTCTGGCCGG TACCGAAGTA TCCGCACTGT TGGGTCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAGGAGCG 800 

TATTACTTCT ACGCAAACCG GTTCCATTAC TTCC 834 



10 2) INFORMATION FOR SEQ ID NO: 751 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 4 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 3142 6 



25 



45 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751 



CGCGACATGA TTCCGCGCGT TTACGACGCT TTGAAATTAG ACGAAAACGG 50 

TCTGACTTTG GAAGTCCAAC AGCTTTTGGG TGATGGCGTA GTCCGTACCA 100 

TCGCTATGGG CAGCTCGGAC GGTTTGAAAC GCGGCATGAC TGTGAGCAAT 150 

ACTGGTTCGC CCATTAC TGT GCCGGTAGGT AAAGGTACGT TGGGACGCAT 2 00 

3 0 TGTCGATGTA TTGGGAACTC CTGTTGACGA GGCAGGTCCA ATTGATACCG 250 

ACAAGAGTCG TGCCATCCAC CAAGCCGCTC CTAAGTTTGA CGAACTGTCT 3 00 

TCCACAACCG AATTGCTCGA AACGGGCATT AAAGTGATTG ACTTGCTGTG 350 

TCCGTTTGCC AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GCCGGTGTGG • 4 00 

GTAAAACCGT GAACATGATG GAATTGAT C A ACAACATCGC CAAAGCGCAC 450 

3 5 AGCGGCTTGT CCGTGTTCTC AGGCGTGGGT GAGCGTACCC GCGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGC CA 550 

TGGTGTATGG C CAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC TGAATACTTC CGTGACGAAA AAGACGAAAA 65 0 

CGGTAAAGGT CGTGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 7 00 

4 0 CTCTGGCCGG TACCGAAGTA TCCGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

TATTACCTCT ACCCAAACCG GTTCCATTAC TTCC 834 



2) INFORMATION FOR SEQ ID NO: 752 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 5 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 27628 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752 



561 



CA 02283458 1999-09-28 



AC ATGATTC C GCGCGTTTAC GACGCTTTGA 
ACTTTGGAAG TCCAACAGCT TTTGGGTGAT 
TATGGGCAGC TCGGACGGTT TGAAACGCGG 
5 GTTCGCCCAT TACTGTGCCG GTAGGTAAAG 
GATGTATTGG GAACGCCTGT TGACGAGG CA 
GAGTCGTGCC ATCCACCAAG CCGCTCCTAA 
CAACCGAATT GCTCGAAACG GGCATTAAAG 
TTTGCCAAAG GCGGTAAAGT AGGTCTGTTC 

10 AACCGTGAAC ATGATGGAAT TGATCAACAA 
GCTTGTCCGT GTTCTCAGGC GTAGGTGAGC 
TTCTACCACG AGATGAAAGA TTCCAACGTA 
GTATGGCCAA ATGAACGAAC CTCCAGGCAA 
CCGGTTTGAC TATGGCTGAA TACTTCCGTG 

15 AAAGGTCGTG ACGTATTGTT CTTCGTTGAC 
GGCCGGTACC GAAGTATCCG CACTGTTGGG 
GTTACCAACC GACATTGGCT GAAGAAATGG 
ACCTCTACCC AAACCGGTTC CATTA 



AATTAGACGA AAACGGTCTG 50 

GGCGTAGTCC GTACCATCGC 100 

CATGACTGTG AGCAATACTG 150 

GTACGTTGGG ACGCATTGTC 200 

GGTCCAATTG ATACCGACAA 250 

GTTTGACGAA CTGTCTTCCA 3 00 

TGATTGACTT GCTGTGTCCG 350 

GGCGGTGCCG GTGTGGGTAA 400 

CATCGCCAAA GCGCACAGCG 450 

GTACCCGCGA AGGTAACGAC 500 

TTGGATAAAG TAGCCATGGT 55 0 

CCGTCTGCGC GTTGCTTTGA 60 0 

AC G AAAAAG A CGAAAACGGT 650 

AACATCTACC GTTACACTCT 70 0 

CCGTATGCCT TCTGCAGTGG 750 

GTCGTTTGCA AGAGCGTATT 800 

825 



2) INFORMATION FOR SEQ ID NO: 753 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 



55 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23 970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753 



GATGCGATTC CGCATGTTTA CGATGCCCTG AAATTGGACG AGAACGGTCT 50 

SccctgSa GTCCAACAGC TTTTGGGTGA CGGCGTTGTC CGTACTATTG 100 

40 CAATGGGTAG TTCAGACGGC CTGAAACGCG GCATGTCTGT CAGCAATACC 150 

S^ScGCCAA TCACTGTGCC GGTAGGTAAA GGTACGTTGG GCCGTATTGT 200 

CGACGTATTG GGT AC AC CTG TTGACGAAGC AGGTCCGATC GATACCGACA 250 

AGAGCCGCGC CATCCACCAA ACCGCCCCGA AATTCGACGA GTTGTCTTCA 3 00 

ACTAC CGAAT TGTTGGAAAC CGGCATTAAA GTGATCGATT TGCTGTGTCC 3 50 

45 GTTTGCTAAG GGCGGTAAAG TAGGTCTGTT CGGTGGTGCC GGTGTGGGCA 400 

AAACCGTGAA CATGATGGAA TTGAT CAACA ACATCGCCAA AGCGCACAGC 4 50 

GGTCTGTCCG TGTTTGCAGG CGTGGGCGAG CGTACCCGCG AAGGTAACGA 500 

CTTCTACCAC GAGATGAAAG ATTCCAACGT ATTGGATAAA GTAGCCATGG 550 

TGTATGGTCA GATGAACGAA CCTCCGGGCA ACCGTCTGCG CGTTGCTTTG 600 

50 ACCGGTTTGA CGATGGCCGA ATACTTCCGC GACGAAAAAG ACGAAAACGG 650 

cSgGCCGC GACGTATTGT TCTTCGTGGA CAACATCTAC CGTTACACCC 700 

TGGCCGGTAC CGAAGTATCC GCACTGTTGG GCCGTATGCC TTCCGCAGTG 75 0 

GGTTACCAAC CGACATTGGC TGAAGAAATG GGTCGTTTGC AAGAGCGTAT 800 

TACCTCTACC CAAACCGGTT CCATTACTTC C 83 1 



2) INFORMATION FOR SEQ ID NO: 754 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 83 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria, meningitidis 

10 (B) STRAIN: 2241C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754 

CCACGCGACA TGATTCCGCG CGTTTACGAC GCTTTGAAAT TAGACGAAAA 5 0 

15 CGGTCTGACT TTGGAAGTCC AACAGCTTTT GGGCGACGGC GTAGTCCGTA 10 0 

CCATTGCGAT GGGCAGCTCG GACGGTTTGA AACGCGGCAT GACTGTGAGC 15 0 

AATACCGGTG CGCCCATTAC TGTGCCGGTA GGTAAAGGTA CGTTGGGACG 20 0 

CATTGTCGAT GTATTGGGAA CGCCTGTTGA CGAGGCAGGT CCAATCGATA 2 50 

CCGACAAGAG CCGTGCCATC CACCAAGCCG CTCCTAAGTT TGACGAACTG 3 00 

20 ' TCTTCCACAA CCGAATTGCT CGAAACGGGC ATTAAAGTGA TTGACTTGCT 3 50 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGC GGTGC CGGTG 4 00 

TGGGTAAAAC CGTGAACATG ATGGAATTGA TCAACAACAT CGCCAAAGCG 4 50 

CACAGCGGCT TGTCCGTGTT CGCAGGCGTG GGTGAGCGTA CCCGCGAAGG 500 

TAACGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GATAAAGTGG 550 

2 5 CAATGGTTTA CGGTCAGATG AACGAACCTC CGGGCAACCG TTTGCGCGTC 600 

GCATTGACCG GTTTGACCAT GGCGGAATAC TTCCGTGACG AAAAAGACGA 65 0 

AAACGGCAAA GGCCGCGACG TATTGTTCTT CGTGGACAAC ATCTACCGTT 700 

ACACTCTAGC TGGTACCGAA GTATC CGCAT TGTTGGGCCG TATGCCGTCT 75 0 

GCAGTGGGCT ACCAACCGAC ATTGGCAGAA GAAATGGGTC GTTTGCAGGA 8 00 

30 GCGTATTACC TCTACCCAAA CCGGTTCTAT TACTTC 83 6 



2) INFORMATION FOR SEQ ID NO: 755 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755 

50 CCGCGTGATG CCATTCCGCG TGTTTACGAC GCCCTGAAAT TGGATGCAAA 50 

CGGCCTGACT TTGGAAGTGC AACAGCTTCT GGGTGACGGC GTAGTTCGTA 100 

CTATTGCAAT GGGTAGTTCG GATGGTTTGA AACGCGGCAT GACTGTAAGC 150 

AATACAGGTG CGC CGATTAC AGTACCGGTA GGTAAAGGTA CTTTGGGACG 2 00 

TATTGTCGAT GTATTGGGTA CGCCTGTTGA CGAAGCAGGT CCGATTGATA 2 50 

55 CCGACAAACA CCGTGCCATC CATCAGACAG CTCCGAAATT TGATGAACTG 3 00 

TCTGCGACTA CTGAGCTGCT GGAAACCGGC ATTAAAGTGA TTGACTTGCT 350 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGT GGTGCCGGTG 4 00 

TAGGCAAAAC CGTGAACATG ATGGAATTAA TTAACAACAT CGCCAAAGCA 450 

CATAGCGGTT TGTCCGTGTT TGCAGGTGTG GGTGAGCGTA CCCGTGAAGG 500 

6 0 TAATGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GACAAAGTGG 550 
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CGATGGTTTA CGGTCAGATG AACGAACCTC 
GCCTTGACCG GTTTGACGAT GGCCGAATAC 
AAGCGGCAAA GGCCGTGACG TATTGTTCTT 
ACACCCTGGC CGGTACCGAA GTATCCGCAT 
5 GCAGTAGGTT ACCAACCGAC ATTGGCTGAA 
GCGTATTACC TCTACCCAAA CAGGCTCCAT 



CGGGTAACCG TCTGCGTGTA 6 00 

TTCCGTGATG AAAAAGACGA 650 

CGTGGACAAT ATCTACCGTT 70 0 

TGTTGGGTCG TATGCCTTCA 750 

GAAATGGGTC GTTTGCAAGA 800 

TACCTCC 837 



10 2) INFORMATION FOR SEQ ID NO: 756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria subflava 

(B) STRAIN: ATCC 14221 



25 



30 



35 



40 



45 



55 



(xi) SEQUENCE DESCRIPTION: 



ID NO: 756 






CTGAAATTGG 


ACGAGAACGG 


50 


TGACGGCGTT 


GTC CGTACT A 


100 


GCGGCATGTC 


TGTAAGCAAT 


150 


AAAGGTACAT 


TGGGTCGTAT 


2O0 


AGCAGGTCCG 


ATCGATACCG . 


250 


CGAAATTCGA 


CGAGTTGTCT 


300 


AAAGTGATCG 


ACTTG CTGTG 


350 


GTTCGGTGGT 


GCCGGTGTGG 


400 


ACAACATCGC 


CAAAGCGCAC 


450 


GAACGTACCC 


GTGAAGGTAA 


500 


CGTATTGGAT 


AAAGTAGCCA 


550 


GCAACCGTCT 


GCGCGTTGCT 


600 


CGTGACGAAA 


AAGACGAAAA 


650 


TGACAACATC 


TACCGTTACA 


700 


TGGGC CGTAT 


GCCTTCTGCA 


750 


ATGGGTCGTT 


TGCAAGAGCG 


800 


TTCC 




834 



2) INFORMATION FOR SEQ ID NO: 757 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 3 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria weaveri 

(B) STRAIN: ATCC 51223 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757 

564 



CA 02283458 1999-09-28 



CCCGTGATGC TATTCCCATG TATAC GATGC CCTGAAATTG GTAGATAACG 50 

ATCTGACCCT GGAAGTGCAA CAACTTTTAG GTGATGGTGT GGTTCGTACC 100 

ATTGCAATGG GTAGTT CAGA CGGCCTAAAA CGTGGTATGG CTGTTAACAA 150 

5 TACCGGCGCT CCGATTACTG TTC CGGTGGG GAAAGCCACC TTGGGACGTA 2 00 

TT AT GGATGT GTTGGGTAAT CCGGTTGATG AAGCAGGTCC TGTTGTATCA 250 

GAAGAAACTC GCGCTATTCA TCAAGCTGCC CCTAAATTTG ACGAACTGTC 3 00 

TTCAGCAACT GAGTTGTTGG AAACAGGCAT TAAAGTAATT GACTTGCTGT 3 50 

GCC CGTTTGC CAAAGGTGGT AAAGTAGGTT TGTTTGGTGG TGCCGGCGTG 4 00 

10 GGTAAAACCG TAAATATGAT GGAGTTGATC AACAACATCG CGAAGG CAC A 4 50 

TAGTGGTTTG TCTGTATTCG CCGGTGTAGG TGAGCGTACC CGTGAAGGTA 500 

ACGACTTCTA CCATGAAATG AAAGACTCTA ACGTATTGGA TAAAGTAGCC 550 

ATGGTTTATG GCCAGATGAA TGAACCTCCG GGTAACCGTT TGCGCGTTGC 600 

TTTGACTGGT TTGACTATGG CCGAATATTT CCGTGACGAG AAAGATGAAA 650 

15 ACGGCAAAGG TCGTGACGTC TTGTTCTTTG TGGATAATAT CTATCGCTAT 700 

ACTCTGGCCG GTACTGAAGT GTCTGCACTG TTAGGT CGTA TGCCGTCTGC 750 

AGTAGGTTAT CAGCCTACAT TGGCAGAAGA AATGGGTCGC TTGCAGGAGC 800 

GTATTACTTC TACTCAAACA GGTTCGATTA CTT 833 



20 



35 



2) INFORMATION FOR SEQ ID NO: 758 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Neisseria animalis 

(B) STRAIN: ATCC 19573 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758 



CGCGATGCCA TTCCTCACGT TTACGATGCC CTGAAATTGG ACGACACCGG 50 

TCTGACTTTG GAAGTACAAC AACTT CTGGG CGACGGTGTG GTACGTACCA 100 

4 0 TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GGGGTTTGTC TGTGAGCAAT 150 

ACCGGTTCTC CGATTGCCGT TCCTGTCGGT AAAGCGACTT TGGGTCGTAT 2 00 

TATGGACGTA TTGGGCAATC CGGTTGATGA AGCCGGTCCG GTTGCTACCG 2 50 

AAGAGAAACG TGCTATTCAC CAAGCCGCAC CGAAGTTTGA TGAATTGT CA 3 00 

TCAGCTACCG AGTTGTTGGA AACCGGTATT AAAGTAATCG ACTTGCTGTG 3 50 

4 5 TCCGTTTGCA AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GC CGGTGTGG 400 

GCAAAACCGT AAACATGATG GAATTGATCA ACAACATCGC CAAAGCACAC 4 50 

AGCGGTCTGT CTGTGTTTGC CGGTGTAGGT GAACGTACCC GCGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTC CAA CGTGTTGGAT AAAGTAGCCA 550 

TGGTGTACGG TCAGATGAAT GAGCCGCCGG GCAACCGCTT GCGCGTGGCT 600 

50 TTGACCGGCC TGACTATGGC CGAATACTTC GTGACGAAAA AGACGAAAAC 650 

GGCAAAGGTC GTGACGTATT GTTCTTCGTG GACAACATTT ACCGCTACAC 7 00 

ACTGGCCGGT ACCGAAGTAT CAGCATTGTT GGGCCGTATG CCGTCTGCAG 750 

TAGGTTATCA GCCGACATTG GCAGAGGAAA TGGGTCGCTT GCAAGAGCGT 8 00 

ATTACCTCTA CCCAAACCGG TTCGATTACC TCT 83 3 

55 



2) INFORMATION FOR SEQ ID NO: 759 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus pennerl 
10 (B) STRAIN: ATCC 33519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759 

AGCGTCCCTA AAGTATACGA CGCTCTTGAG GTTATGAATG GTAAAGAAAA 5 0 

15 ACTGGTGCTG GAAGTTCAGC AACAGTTAGG CGGTGGTATC GTTCGTTGTA 100 

TCGCAATGGG TACATCAGAC GGTTTAAGCC GTGGCTTAAA AGTTGAAGAT 150 

TTAGGCCACC CAATTGAAGT ACCAGTAGGT AAAGCAACAC TGGGACGTAT 2 00 

CATGAACGTT CTGGGTACAC CTATTGATAT GAAAGGTGAT ATTGCAACTG 250 

AAGAACGTTG GTCTATTCAC CGTGAAGCAC CAACCTACGA AGAGTTATCA 3 00 

2 0 AACTCACAAG AACTGCTTGA AACCGGTATC AAAGTAATGG ACTTAATCTG 350 

TCCGTTTGCT AAAGGTGGTA AAGTAGGTCT GTTCGGTGGT GCGGGTGTGG 4 00 

GTAAAACAGT TAACATGATG GAATTGATCC GTAATATCGC GATCGAGCAC 45 0 

T CAGGTT AC T CTGTATTTGC TGGTGTTGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAT CATGAAATGA CAGATTCTAA CGTTCTTGAC AAAGTATCGT 55 0 

2 5 TAGTTTATGG TCAGATGAAT GAGCCACCAG GAAACCGTCT GCGTGTAGCA 600 

CTGACTGGTC TGAC TATGGC TGAAAAATTC CGTGATGAAG GCCGTGACGT 65 0 

ACTGTTATTC GTCGATAACA TTTATCGTTA CACCTTAGCC GGTACAGAAG 700 

TATCAGCACT GTTAGGTCGT ATGC CAT CAG CGGTAGGTTA CCAGCCAACA 75 0 

TTGGCTGAAG AGATGGGTGT TCTGCAAGAA CGTATCACTT CAACCAAAAC 800 

3 0 AGGTTCAATC ACCTCTGTA 819 



2) INFORMATION FOR SEQ ID NO: 760 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (a) ORGANISM: Salmonella choleraesuis subsp . choleraesuis 

serotype enteritidis 
(B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760 

50 GCCGTACCAC G CGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 5 0 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTAC CA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 15 0 

CTCGAACACC CGAT CGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 2 00 

55 CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 2 50 

AAGAGCGT TG GGCGATTCAC CGCGCAGCAC CTTC CTACGA AGAGTTGT CA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 4 50 

60 TCCGGTTACT CAGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 
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CGACTTCTAC CACGAAATGA CCGACTCCAA 
TGGTGTATGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGCC TGACCATGGC GGAGAAATTC 
ACTGCTGTTC GTCGATAACA TCTATCGTTA 
5 TATCCGCACT GCTGGGCCGT ATGCCTTCCG 
CTGGCGGAAG AGATGGGCGT TCTGCAGGAA 
CGGTTCTATC ACCTCCGTA 



CGTTATCGAT AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCA 600 

CGTGACGAAG GTCGTGACGT 65 0 

CACCCTGGCC GGTACGGAAG 70 0 

CAGTAGGTTA CCAGCCGACT 750 

CGTATCACCT CCACCAAAAC 800 

819 



10 



2) INFORMATION FOR SEQ ID NO: 761 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pestis 

(B) STRAIN: RRBK1M27 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761 

AC CAAAAGTG TACAACGCCC TTGAGGTAGA AGGTACAACT GAAAAGTTAG 50 

TGCTGGAAGT TCAGCAACAG TTGGGCGGTG GTGTTGTTCG TTGTATCGCA 100 

ATGGGCTCTT CCGATGGTCT GAGCCGTGGG TTGAAAGTAA CCAACCTAGA 150 

3 0 ACACCCGATC GAAGTAC CGG TTGGTAAAGC GACCCTTGGC CGTATCATGA 2 00 

ACGTATTGGG TGAACCAATC GACATGAAAG GTCCTATCGG TGAAGAAGAG 2 50 

CGTTGGGCAA TCCATCGCGA AGCGCCTTCT TATGAAGAGC TTGCTAGCTC 3 00 

ACAAGATCTG TTAGAAAC CG GTATCAAGGT TATGGACCTG ATTTGTCCGT 3 50 

TTGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA 4 00 

3 5 ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCTGG 450 

GTATTCTGTA TTTGCCGGTG TAGGTGAGCG TACCCGTGAG GGTAATGACT 500 

TCTACCATGA AATGACTGAC TCCAACGTTT TGGACAAAGT ATCCTTGGTT 550 

TACGGCCAGA TGAATGAGCC ACCAGGTAAC CGTCTACGCG TTGCACTGAC 600 

CGGCCTGACC ATGGCGGAGA AATTCCGTGA TGAAGGTCGT GACGTACTGC 650 

4 0 TGTTTATCGA TAATATCTAT CGTTATACCC TAGCTGGTAC GGAAGTATCC 700 

GCATTGCTGG GTCGTATGCC ATCAGCGGTA GGTTATCAGC CAACACTGGC 750 
TGAAGAGATG GGTGTGTTGC AGGAACGTAT TACTTCCACT AAGACGGGTT 8 00 

CAATCACCTC TG 812 

45 

2) INFORMATION FOR SEQ ID NO: 762 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 408 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762 

CGACGTGATC GAGCCCTTCT TCGTCGACGT GATGCGCTCT TGCAGCTTGC 

CCATTTCTTC AGCCAGCGTC GGCTGATAGC CCACTGCCGA CGGCATACGG 

CCGAGCAGCG CCGACACTTC GGTACCGGCC AGCGTGAAAC GGTAGATGTT 

GTCGACGAAG AACAGCACGT CGAGGCCTTC GTCACGGAAG TGCTCGGCCA 

TCGTGAGGCC CGTCAGCGCC ACGCGCAGAC GGTTGCCCGG CGGCTCGTTC 

ATCTGGCCGT ACACCAGCGC GACCTTGTCG AGAACGTTCG AGTCCTTCAT 

TTCGTGGTAG AAGTCGTTCC CTTCACGGGT ACGCTCGCCC ACGCCCGCGA 
AC AC GGAGT A ACCGCCGTGC TCCTTCGCGA TGTTGTTGAT GAGCTCCATC 
ATGTTGAC 



2) INFORMATION FOR SEQ ID NO: 763 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium sordellix 

(B) STRAIN: ATCC 9714 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763 

GAACTTATAA ACAACATAGC TACTCAACAT GGTGGTATAT CAGTATTCGC 
AGGTGTTGGA GAGAGAACAA GAGAAGGTAA CGACCTTTAC GGAGAAATGA 
GTGAGTCTGG AGTTATAAAC AAGACAGCTC TAGTATTCGG ACAAATGAAT 
GAGCCACCTG GAGCAAGAAT GAGAGTTGCT TTAACTGGTC TTACAATGGC 
TGAATATTTC AGAGATCAAG AAGGACAAGA CGTTTTATTA TTCGTAGATA 
ATATATTCCG TTTCACTCAA GCAGGATCTG AGGTTTCTGC ACTTCTTGGA 
CGTACTCCAT CAGCAGTTGG ATAC CAAC CA ACATTAGCTA CAGAGATGGG 
TAGATTACAA GAGAGAATAA C ATC T AC AAA TAAAGGGTCT ATAACATCAG 



2) INFORMATION FOR SEQ ID NO: 764 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 764 

TTAATTCAAG AATTAAT CAA CAATATAGCG AAGGAACACG GTGGATTATC 
TGTATTTACA GGAGTTGGAG AAAGAACAAG AGAAGGTAAT GACCTTTACT 
ATGAAATGAA AGAATCTGGT GTTATAAATA AAACAGCACT AGTATTTGGT 



568 



CA 02283458 1999-09-28 



CAAATGAATG AGCCACCTGG AGCAAGAATG 
TACTATGGCA GAATATTTCA GGGATCAAGG 
TAGACAACAT ATTTAGATTC ACTCAAGCAG 
CTTGGAAGAA TACCTAGTGC CGTTGGATAC 
5 AATGGGTGCG CTTCAAGAAA GAATAACATC 
CATCA 



AGAGTTGCTC TTACAGGACT 2 00 

ACAAAACGTA CTTTTATTCA 250 
GTTCAGAGGT GTCAGCTTTA 3 00 

CAACCAACAC ' TAGCAACAGA 350 
TACAAAGCAT GGTTCTATAA 4 00 

405 



10 2) INFORMATION FOR SEQ ID NO: 765 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 
(B> STRAIN: 20:3.1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765 



25 



AATTAATAAA TAACATAGCT AAAGAACATG GTGGATTATC AGTGTTTACT 50 

GGAGTTGGAG AAAGATC TAG AGAAGGTAAT GATCTTTATC ATGAAATGAG 100 

AGAATCAGGC GTTATAGATA AAACCGCATT AGTATTTGGT CAAATGAATG 150 

AGCCACCTGG TGCTAGAATG AGAGTTGCAT TAACAGGGTT AACTATGGCT 2 00 

3 0 GAATATTTTA GAGATAAAGG TCAAGATGTA TTACTATTTA TAGATAATAT 250 

ATTTAGATTT ACTCAAGCTG GATCAGAAGT TTCAGCATTA CTTGGAAGAA 3 00 

TACCATCAGC AGTTGGTTAC CAACCAACTT TAGCAACTGA AATGGGTGCA 350 

CTTCAAGAGA GAATTACATC AACTAAGAAT GGTTCTATAA CAT 3 93 



35 



2) INFORMATION FOR SEQ ID NO: 766 



(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 3 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium histolyticum 

(B) STRAIN: ATCC 194 01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766 



50 



AGGAACTTAT AAATAATATT GCAAAACAAT ATGGAGGTCT AT CTGTATTT 50 

ACAGGTGTTG GTGAAAGAAC AAGAGAAGGT AATGACCTAT ATAATGAAAT 100 

55 GAAAGATTCT GGGGTTATAG AGAAAACTGC ACTAGTATTT GGACAGATGA 150 

ATGAACCACC AGGAGCGAGA ATGAGAGTAG CATTGACAGG ACTTACTATG 2 00 

GCAGAATATT TTAGAGATCA AGGGCAAGAT GTACTTTTAT TTATAGATAA 2 50 

TATATTTAGA TTTACGCAGG CTGGTTCTGA AGTTTCTGCA TTGTTAGGAA 300 

GAATTCCAAG TGCAGTTGGA TATCAAC CAA CCCTTGCAAC TGAAATGGGT 350 

6 0 GCATTACAAG AAAGAATAAC ATCCACAAAA AATGGATCAA TTACTTCAG 3 99 
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2) INFORMATION FOR SEQ ID NO: 767 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 90 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN: ATCC 9321 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 67 

2 0 CATGATGGAA CTGATCCGTA ACATCGC CAT CGAGCACAGC GGTTATTCCG 5 0 

TGTTCGCCGG TGTGGGTGAG CGTACTCGTG AGGGTAACGA CTTCTACCAC 100 

GAGATGAAGG ACTCCAACGT TC TGGAC AAA GTGGCACTGG TCTACGGTCA 150 

GATGAACGAG CCGCCGGGTA ACCGTCTGCG CGTAGCACTG AC TGGCCTGA 200 

CCATGGCCGA GAAGTTCCGT GACGAAGGTA ACGACGTTCT GTTGTTCGTC 250 

25 GACAACATCT AC C GTTACAC CCTGGCCGGT ACTGAAGTAT CCGCACTGCT 3 00 

GGGCCGTATG CCTTCGGCAG T AGGTTAC C A GCCGACCCTG GCTGAAGAGA 350 

TGGGCGTTCT GCAAGAACGT ATCACTTCGA CCAAGGAAGG 390 



30 



40 



2) INFORMATION FOR SEQ ID NO: 768 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 953 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768 

AGGTCTTGTT CGTGGCAAAA GGTCATTGAC ACTGGTGCTC CTATCACCAT 50 

TCCTGTTGGT AACGAAGTCC TTGGTCGTAT CATTAACGTC ATTGGTGAGC 100 

CCATTGATGA GCGTGGTCCY ATCAAGTCCA AGGCCACTCG TGCTATCCAC 150 

50 GCTGATGCTC CCGAGTTCGT TGATCAATCC CCCACTCCCG AGATTCTCGA 200 

GACTGGTATC AAGGTTGTCG ATTTGCTTGC TCCCTATGCT CGTGGTGGTA 250 

AGATTGGTCT TTTCGGTGGT GCCGGTGTCG GCAAGACTGT CTTGATTCAA 3 00 

GAATTGAT CA ACAACATTGC CAAGGCTCAC GGTGGTTACT CTATCTTCTG 3 50 

TGGTGTCGGT GAACGTACTC GTGAAGGCAA CGATTTGTAC CACGAAATGA 4 00 

55 TTTCCACTGG TGTCATCAAG CTTGAAGGTG AATCCAAGTG TGCTCTTGTC 450 

TTTGGTCAAA TGAACGAACC CCCCGGAGCT CGTGCCCGTG TTGCCTTGAC 5 00 

TGGTTTGACC ATTGCCGAAT ACTTCCGTGA TGAGGAAGGT CAAGATGTGT 550 

TGCTCTTCAT TGACAACATT TTCCGTTTCA CTCAAGCCGG TTCTGAAGTG 6 00 

TCCGCTTTGC TTGGTCGTAT TCCCTCTGCT GTCGGTTACC AACCCACTCT 650 

60 YTCCACTGAT ATGGGTGGTA TGCAAGAGCG TATTACTACC AC C AAGAACG 7 00 
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GTTCCATTAC CTCCGTGCAA GCTGTCTACG 
GATCCTGCTC CTGCTACTAC TTTTGCTCAC 
GTCTCGTTCC ATTGCTGAGT TGGGTATCTA 
ACTCCAAGTC CCGTATCTTG GATCCCCGTA 
5 TCTGTTGCCA CTGGTGTCCA ACAAATCCTT 
AGA 



TCCCTGCTGA CGATTTGACT 75 0 

TTGGACGCCA CCACTGTGTT 80 0 

CCCTGCTGTC GATCCCCTTG 85 0 

TCGTTGGTGA TGAGCACTAC 900 
CAAAACTACA AGTCGCTTCA 950 

953 



10 2) INFORMATION FOR SEQ ID NO: 769 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 620 99 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769 



CGCGGTTCCA AGGC CACCGA CACCGGTGCC CCCATCAAGA TTCCCGTTGG 5 0 

TCACGGTACC CTTGGTCGTA TCATGAACGT CACTGGTGAC CCCATTGACG 10 0 

AGCGTGGTCC CATCAAGGCC AC CAAGTACG CTCCCATCCA CGCCGACCCC 15 0 

CCGGAGTTCA CCGAGCAATC CACCTCCGCT GAGGTCCTCG TTACCGGTAT 200 

3 0 CAAGGTTGTC GACCTGTTGG CTCCTTACGC TCGTGGTGGA AAGATTGGTC 250 

TCTTCGGAGG TGCTGGTGTC GGAAAGACTG TCTTCATTCA GGAGC TGATT 3 00 

GTAAGGAGAC ACACTGTCTA CTGGCTGAGC ATTAGCTAAC GGCAGGCAGA 3 50 

ACAACATCGC CAAGGCCCAC GGTGGTTTCT CTGTCTTCAC TGGTGTCGGT 4 00 

GAGCGTACCC GTGAGGGTAA CGATCTGTAC CACGAGATGC AGGAGACTTC 4 50 

3 5 CGTCATTCAG CTTGACGGTG ACTC CAAGGT CGCCCTCGTC TTCGGTCAGA 5 00 

TGAACGAGCC CCCGGGTGCC CGTGCCCGTG TCGCTCTTAC TGGTCTTACT 550 

GTTGCTGAGT AAGTCTTGAA TTCACTGTGT TGACAACGTC GTGGCTAATG 60 0 

GGAAAAAAGA TACTTCCGTG ACGAAGAGGG ACAGGATGTG CTTCTCTTCA 65 0 

TCGACAGTAA GTGCTTGTAC GAACTGCCTG TGAGACATAC ACTGACTTCG 70 0 

4 0 GCAATAGACA TTTTCCGTTT CACCCAGGCC GGTTCCGAGG TGTCCGCTCT 75 0 

TCTTGGTCGT ATCCCCTCTG CCGTCGGTTA CCAGCCCACC CTCGCCATTG 80 0 

ACATGGGTGT CATGCAGGAG CGTATTACCA CCACCACCAA GGGTTCCATC 85 0 

ACCTCCGTCC AGGCCGTCTA CGTGCCCGCT GACGATTTGA CTGACCCTGC 90 0 

CCCCGCCACC ACCTTCGCCC ATTTGGACGC CACCACTGTC TTGTCCCGTG 950 

45 GTATCTCCGA GTTGGGTATC TACCCCGCCG TCGACCCTCT TGACTCCAAG 10 00 

TCCCGTATGT TGGACCCCCG TGTCATTGGT CAGGACCACT ACGACACCGC 1050 

CACCCGCGTT CAGCAGATTC TCCAGGAGTA CAAGTCGCTC CAGGATATCA 1100 

TTGCCATTCT CGGTATGGAC GAGTTGTCGG AAGCTGACAA GCTTACCGTC 115 0 

GAGCGTGCCC GTAAGATCCA GCGTTTCTTG AGCCAGCCTT TCGCTGTCGC 12 0 0 

50 CCAGGTCTTC ACTGGTATTG AGGGCAAGCT TGTCGACCTC AAGGACACCA 12 50 

TCCGATCATT CAAGGC TATC TTGACTGGTG AGGGTGACGA CCTTCCCGAG 13 00 

GGTGAGTCTC GACTATCTCC GCATTCATAG CGTATAACTG ACA 13 4 3 



55 

2 ) INFORMATION FOR SEQ ID NO: 770 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 480 bases 
60 (B) TYPE: Nucleic acid 



571 
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10 



(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flaws 

(B) STRAIN: ATCC 26947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 770 



TTCAGGAGTT 
ACTCTTTTCT 
ACTCTGTCTT 

15 TACCACGAAA 
GGTCGCCCTT 
GTGTCGCCCT 
GGTCAGGATG 
CGGTTCTGAG 

2 0 ACCAGCCCAC 
ACCACCACCA 



GATTGTATGT 
AACTCTTCAC 
CACTGGTGTC 
TGCAGGAGAC 
GTCTTCGGTC 
TACCGGTCTG 
TGCTGCTCTT 
GTGTCTGCCC 
TCTGGCCGTC 
AGGGTTCCAT 



TCACCTGCAA 
AGAACAACAT 
GGTGAGCGTA 
TGGTGTCATT 
AGATGAACGA 
ACCATCGCCG 
CATTGACAAC 
TTCTTGGTCG 
GACATGGGTG 
TACCTCCGTC 



CATAAGACTT 
TGCCAAGGCT 
CCCGTGAGGG 
CAGCTCGAGG 
GCCCCCAGGT 
AGTACTTCCG 
ATTTTCCGTT 
TATCCCCTCC 
GTATGCAGGA 



CCCATTCTCC 5 0 

CACGGTGGTT 100 

TAACGATCTG 150 

GTGAATCTAA 2 0 0 

GCCCGTGCCC 250 

TGACGAGGAA 3 00 

TCACCCAGGC 350 

GCTGTCGGTT 4 00 

GCGTATTACC 45 0 
480 



25 2) INFORMATION FOR SEQ ID NO: 771 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



35 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mucor clrcinelloides 

(B) STRAIN: ATCC 3 8592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771 



CTC GAACAAG AYAACTTGCC YGCCATTTTG 
YCACTCTGGT GGACGTCTCG TYCTCGAAGT 
ACACTGTCCG TACTATTGCT ATGGATGGTA 
TCCCANNGGA TACAGTCARA CAGMAATGTC 
GATGATTGAC CAATATGTTA GGTCTTGTCC 
CACTGGTGCT CCCATTACCA TCCCCGTTGG 
TCATCAACGT TATTGGTGAA CCCATTGATG 
AAGACTCACC GTCCTATTCA CGCTGAAGCT 
CCCCACTCCC GAAATCCTCG AGACTGGTAT 
CTCCTTATGC TCGTGGTGGT AAGATTGGTC 
GGTAAGACTG TCTTGATTCA AGAATTGATT 
CGGTGGTTAC TCTATCTTCT GTGGTGTCGG 
ACGATTTGTA CCATGAAATG ATTGAAACCG 
GAGTC CAAGT GTGCTCTCGT CTTTGGTCAA 
TCGTGCCCGT GTCGCTTTGA CTGGTTTGAC 
ATGATGAGGG TCAAGATGTC TTGCTTTTCA 
ACTCAAGCTG GTTCTGAGGT ATCTGCCCTT 
TGTCGGTTAC CAACCCACTC TTTCCACYGA 
GTATTACTAC CACCAAGAAC GGTT CCATTA 
GTCCCTGCTG ATGAT TTGAC CGATCCTGCT 



AACGCCCTTG 
CTCTCAACAT 
CTGAAGGTAA 
TAGTGGTTAT 
GGTGGTCAAA 
TAAGGAAGTC 
AACGTGGTCC 
CCCGAATTCG 
CAAGGTYGTC 
TCTTCGGTGG 
AACAACATYG 
TGAACGTACT 
GTGTCATTCA 
ATGAACGAAC 
TATTGCTGAA 
TTGATAACAT 
TTGGGTCGTA 
TATGGGTGGT 
CCTCCGTCCA 
CCTGCCACCA 



AAGTCAAGGA 50 

TTGGGTGAGA 10 0 

GTTATGTYCA 150 

AGCAGYAGCA 200 

AGGTTGTTGA 250 

CTTGGTCGTA 30 0 

CATTGACGCC 3 50 

TTGAT CAATC 400 

GATTTGTTGG 4 5 0 

TGCTGGTGTC 500 

CCAAGGCT CA 550 

CGTGAGGGTA 600 

ATTGGAAGGC 650 

CCCCAGGTGC 70 0 

TACTTCCGTG 750 

TTTCAGATTC 800 

TTCCTTCCGC 8 50 

ATGCAAGAGC 90 0 

AGCTGTCTAC 950 

CTTTTGCTCA 10 0 0 



572 
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CTTGGATGCC ACCACTGTCT TGTCTCGTTC CATCGCTGAA TTGGGTATCT 105 0 

AYCCCGCTGT CGATCCTCTT GATTCCAAGT CTCGTATCCT CGATCCCCGT 1100 

ATTGTCGGTG ATGAGCACTA CAAGGTTGCC ACTGAAGTTC AACAAATCCT 1150 

CCAAAACTAC AAGTCTCTCC AAGA 1174 



2) INFORMATION FOR SEQ ID NO: 772 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 67 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedrala hortai 
2 0 (B) STRAIN: ATCC 242 92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772 

AGGAGCTTAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

2 5 ACTGGTGTCG GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT 100 

GCAAGAGACT TCCGTCATTC AGCTTGACSG CGAGTCCAAG GTCGCTCTCG 150 

TGTTCGGTCA GATGAACGAG CCCCCGGGTG CCCGTGCCCG TGTTGCCCTG 2 00 

ACTGGTCTTA CCATCGCTGA GTACTTCCGT GATGCCGAGG GTCAGGATGG 2 50 

TAAGTTCTAT AACTCTTGTC GCAAAGGTTT CATTCTGGTC GCTAACTTGC 3 00 

3 0 TCAGTGCTCC TGTTCATCGA CAACATTTTC CGTTTCACCC AGGCTGGTAT 3 50 

GGAGGTGTCT GCCCTCCTCG GTCGTATTCC TTCTGCCGTC GGTTACCAAC 4 00 

CTACTCTCGC CGTCGACATG GGTGGTATGC AAGAGCGTAT TACCACTACC 4 50 

AAGAAGGGAT CCATTAC 4 67 



35 



2) INFORMATION FOR SEQ ID NO: 773 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 578 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Pseudallescheria boydli 

(B) STRAIN: ATCC 44331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773 



50 



TTGCCAAGGC CCACGGTGGT TACTCTGTCT TCACTGGTGT CGGTGAGCGT 50 

ACCCGTGAGG GTAACGATCT GTAC CACGAA ATGCAGGAAA CCTCCGTCAT 100 

55' TCAGCTTGAT GGCGAGTCCA AGGTCGCGCT TGTCTTCGGT CAGATGAACG 150 

AGCCCCCTGG TGCCCGTGCT CGTGTCGCTC TTACTGGTCT TACCGTTGCC 2 00 

GAGTACTTCC GTGATGAGGA GGGT CAGGAT GGTAAGTTAT ATCGTTTTTA 2 50 

TTATCTTCTT TGCCACCACC CCTCTACGAA TCCATGCCTC CGTTGGTGAA 3 00 

GGCATCGTTT GTAGGGCGGG TCGGAGTTTG CGGCAATTTC TGCCGTCGGC 3 50 

60 TTGAAGCCGC GGATGCCCGA TGTTTGACGC GTATCGATGC TAACAACAAT 4 00 

573 
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GACAACAGTG CTTCTCTTCA TTGACAACAT 
GTTCCGAGGT GTCTGCCCTT CTCGGTCGTA 
CAGCCCACTC TTGCCGTAGA CATGGGTGCC 
CACCAAGAAG GGTTCGATTA CCTCCGTC 



TTTCCGATTC ACCCAGGCCG 4 50 

TTCCCTCTGC CGTCGGTTAC 50 0 

ATGCAGGAAC GT ATT AC C AC 550 

578 



2) INFORMATION FOR SEQ ID NO: 774 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhizopus oryzae 
20 (B) STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774 

AACTTACCYG CTATCTTGAA CGCTCTCGAA GTC CAAGATC ACTCTGGTGG 5 0 

25 ACGTCTTGTC CTTGAAGTTC GCTCAACACT TGGGTGAAAA TACTGTCCGT 100 

ACTATTGCTA TGGATGGTAC TGAAGGTAAG CTATACTATA ACCGTKTTAT 150 

CCGAGTATGA TATTAACTTG AAAAAAGGTC TCGTYCGTGG TCAAAAGGTT 200 

ATTGACACTG GTGCTCCCAT TACCATTCCT GYTGGTAAGG AAGTTCTCGG 250 

TCGTATCATT AACGTCATTG GTGAACC CAT CGATGAACGT GGTCCTATCA 3 00 

3 0 ACGCCAAGAG CCAACGTCCC ATTCACGCCG AAGCTCCCGA ATTCGTTGAC 3 50 

CAATCTCCTA CTCCCGAAAT TCTTGAAACT GGTATCAAGG TTGTCGACTT 4 00 

GTTGGCTCCT TATGCTCGTG GTGGTAAGAT TGGTCTTTTC GGTGGTGCTG 450 

GTGTCGGTAA GACTGTGTTG ATTCAAGAAT TGATTAACAA CATCGCCAAG 50 0 

GCTCACGGTG GTTACTCTAT TTTCTGTGGT GT CGGTGAAC GTACTCGTGA 550 

35 AGGTAACGAT CTTTACCACG AAATGATTGA AACTGGTGTC ATCAAGCTCG 600 

ATGGTGACTC CAAGTGTGCT CTTGT CTTTG GTCAAATGAA CGAACCCCCA 650 

GGAGCTCGTG CCCGTGTTGC CTTGACTGGT TTGACCATTG CTGAATACTT 700 

CCGTGATGAT GAAGGTCAAG ATGTGTTGCT TTTCATTGAT AACATTTTCC 750 

GTTTCACCCA AGCTGGTTCW GAAGTATCTG CCCTTTTGGG TCGTATTCCC 8 00 

4 0 TCCGCTGTCG GTTACCAACC CACTCTTTCT ACTGATATGG GTGGTATGCA 850 

AGAAC GTATT ACAACCACCA AGAACGGTTC CATTACATCC GTCCAAGCTG 90 0 

TCTACGTCCC TGC TGATGAT TTGACCGATC CTGCTCCCGC CACCACTTTT 950 

GCTCACTTGG ATGCCACCAC TGTCTTGTCT CGTTCCATTG CCGAATTGGG 100 0 

TATTTACCCT GCCGTCGATC CTCTTGAYTC CAAGTCTCGT ATCTTGGATC 1050 

45 CTCGTATCGT TGGTGACGAA CACTACAAGG TCGCTACCGA AGTT CAACAA 1100 

ATCCTTCAAA ACTACAAGTC TCT 1123 



50 2) INFORMATION FOR SEQ ID NO: 775 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

574 
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(A) ORGANISM: Scopulariopsis koningii 

(B) STRAIN: ATCC 38745 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775 

5 

ATTCAGGAGC TCATCAACAA CATTGCCAAG GCTCACGGTG GTTACTCTGT 50 

GTTCACTGGT GTCGGTGAGC GTACCCGTGA GGGTAACGAT CTGTACCACG 10 0 

AAATGCAGGA GACTTCGGTC ATTCAGCTCG AGGGCGAGTC CAAGGTCGCG 150 

CTTGTGTTCG GTCAGATGAA CGAGCCCCCC GGTGCCCGTG CCCGTGTCGC 2 00 

10 CCTTACCGGT CTGACCGTTG CCGAGTACTT CCGTGACGAG GAGGGC CAGG 250 

ATGGTGAGTA ACCGACGAAG TCTGAGATCT TGTCGGGCAT TATTCTAACG 3 00 

ACAACTAGTG CTTCTCTTCA TCGACAACAT TTTCCGCTTC ACCCAGGCCG 3 50 

GTTCCGAGGT GTCCGCGCTT CTCGGCCGTA TCCCCTCTGC CGTCGGTTAC 400 

CAGCCCACCC TGGCCGTCGA CATGGGAGGT ATGCAGGAGC GTATTACCAC 4 50 

15 GACTCAGAAG GGCTCGATTA CCTCGGT 4 77 



2) INFORMATION FOR SEQ ID NO: 776 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776 

35 TCCGGAGTTG ATTGTAAGTC ATTTGAAACC CAGCCCCAAG AAACAGAAGC 50 

TAGGCGAAAA TTGGACAATT GAGCAATTTA GCCATTGGAG AAAAGAAATT 10 0 

TCGAGTATTA ATTGTTTTTA TAGAACAACA TTGCCAAGGC * TCACGGTGGT 150 

TACTCTGTCT TCACTGGTGT CGGAGAGCGT ACCCGTGAAG GAAACGATCT 200 

CTACCATGAA ATG CAGGAGA CCCGTGTCAT TCAGCTTGAT GGCGAGTCCA 2 50 

4 0 AGGTCGCCCT GGTCTTCGGC CAGATGAACG AGCCCCCAGG TGCCCGTGCC 3 00 

CGTGTTGCTC TTACTGGTTT GACCATTGCT GAGTACTTCC GTGATGAGGA 350 

AGGTCAAGAC GGTGAGTTTY TTATGGATAA AARATTTTTT TTTTTTTTTT 4 00 

TTTTTTMAAR AAATTCATGT TCTAACAAAG TATATCCTAG TGCTTCTCTT 4 50 

CATCGACAAC ATTTTCCGTT TCACTCAGGC TGGTTCCGAA GTGTCTGCCC 500 

45 TGCTTGGTCG TAT C CCATCT GCCGTCGGTT ACCAACCCAC TCTTGCCGTC 550 

GACATGGGTG GTATGCAGGA ACGTATTACC ACCACCAAGA AGGGATCCAT 60 0 

TACCTCCGTC 610 



50 

2 ) INFORMATION FOR SEQ ID NO: 777 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 593 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 

575 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophytc 

(B) STRAIN: ATCC 56185 

5 (xi) SEQUENCE DESCRIPTION: SEQ 

GGAGTTGATT GTAAGT CATT TGAAACCCAG 
GTGAAAATTG GACAATTGAA CAATTTAGCC 
AGTATTAATT ATTTTTATAG AACAACATTG 

10 TCTGTCTTCA CTGGTGTCGG AGAGCGTACC 
CCATGAAATG CAGGAGACCC GTGTCATTCA 
TCGCCCTGGT CTTCGGCCAG ATGAACGAGC 
GTTGCTCTTA CTGGTTTGAC CATTGCTGAG 
TCAAGACGGT GAGTTTCTTA TGGATGAAAG 

15 TCATGTTCTA ACAAAGTGTA TTCTAGTGCT 
TCCGTTTCAC TCAGGCTGGT TCCGAAGTGT 
CCATCTGCCG T CGGTTAC C A ACCCACTCTT 
GCAGGAACGT ATTACCACCA CCAAGAAGGG 



tonsurans 



ID NO: 777 

CCCCAAGAAA CAGAAGCTAG 50 

CTTGGAGAAA AGAAATTTCG 100 

CCAAGGCTCA CGGTGGTTAC 150 

CGTGAAGGAA ACGATCTCTA 2 00 

GCTTGATGGC GAGTCCAAGG 250 

CCCCAGGTGC CCGTGCCCGT 3 00 

TACTTCCGTG ATGAGGAAGG 3 50 

ATTTTTTTTT TTCAAGAAAT 4 00 

TCTCTTCATC GACAACATTT 450 

CTGCCCTGCT TGGTCGTATC 500 

GCCGTCGACA TGGGTGGTAT 550 

ATCC ATT ACC TCC 593 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 778 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778 



CCGTGGTCAA 
GTCGTGGTAC 

40 GAACGTGGCC 
ACCAACCTTT 
TCAAGGTTGT 
CTTTTCGGTG 
TAACAACATT 

4 5 GTGAAAGAAC 
GGTGTCATCA 
AATGAACGAA 
CCATTGCCGA 
GTTGACAACA 

50 TTTGGGTCGT 
ATATGGGTGC 
ACATCTGTCC 
CCCAGCCACC 
CCATTTCCGA 

55 TCTCGTCTTT 
CACTCAAGTT 
TTGCCATTTT 
GAACGTGCCC 
CGAAGTTTTC 

60 TCAGATCTTT 



GAAGTTATTG 

TCTTGGTAGA 

CTATCAAGGC 

GCTGAACAAT 

CGATCTTCTT 

GTGCTGGTGT 

GCCAAGGCTC 

CCGTGAAGGT 

ACCTCGAAGG 

CCTCCAGGTG 

ATACTTC CGT 

TTTTCAGATT 

ATTCCATCTG 

CCTCCAAGAA 

AAGCCGTCTA 

ACTTTCGCTC 

ATTGGGTATC 

TGGATCCTGA 

CAACAAACCC 

GGGTATGGAT 

GTAAGATCCA 

ACTGGTATCG 

CAAGGAAATC 



ACACTGGTGC 

ATTATCAACG 

TTCTAAGTAT 

CTACTTCTGC 

GCTCCTTACG 

CGGTAAGACT 

ACGGTGGTTT 

AACGATCTTT 

TGACTCCAAG 

CCCGTGCCCG 

GATGAAGAAG 

CACCCAAGCC 

CCGTCGGTTA 

CGTATTACCA 

TGTCCCAGCA 

ACTTGGACGC 

TACCCAGCTG 

AGTTATTGGA 

TCCAAGCTTA 

GAATTGTCTG 

AAGATTCCTT 

AAGGTCGTCT 

CTTGAAGGTA 



CCCAATTACC 

TCATTGGTGA 

GCTCCTATCC 

TGAAGTTCTT 

CCCGTGGTGG 

GTCTTCATTC 

CTCTGTCTTC 

ACCGTGAAAT 

GTCGCTCTCG 

TGTCGCTTTG 

GACAAGATGT 

GGTTCTGAAG 

CCAACCTACC 

CCACCCAAAA 

GACGATTTGA 

CACCACTGTC 

TCGATCCTCT 

CACGAACACT 

CAAGTCTCTC 

AAGCTGATAA 

TCCCAACCAT 

CGTTCCATTG 

AGTACGATCA 



ATTCCTGTTG 50 

ACCAATTGAC 100 

ATACTGAACC 150 

GAAACCGGTA 200 

TAAGATTGGT 250 

AAGAACTTAT 300 

ACTGGTGTCG 350 

GAAGGAAACT 4 00 

TTTTCGGTCA 45 0 

ACTGGTCTTA 500 

CTTGCTTTTC 550 

TCTCTGCTCT 6 00 

CTTGCTACCG 6 50 

GGGTTCCGTC 700 

CTGATCCTGC 750 

TTGTCTCGTT 800 

CGATTCCAAG 850 

ACGAAGTTGC 900 

CAAGAT AT C A 950 

GCTTACTGTC 1000 

TCGCTGTTGC 1050 

AAGGAAACCG 1100 

C 1141 
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2) INFORMATION FOR SEQ ID NO: 779 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 22864 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 779 

2 0 GCTGAAGGCC GAACGTGAGC GTGGTATCAC CAT CGAT ATC GCGCTCTGGA 50 

AGTTCGAGAC TCCCAAGTAC TTCGTCACCG TCATCGATGC CCCTGGTCAT 100 

CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG CTGATTGTGC 150 

TATTCTCATC ATTGCCGCTG GT AC TGGTGA GTTCGAGGCC GGTATCTC CA 2 00 

AGGATGGCCA GACCCGTGAG CATGCTCTGC TCGCCTACAC CCTGGGCGTG 2 50 

2 5 AAGCAGCTTA TCGTCGCCAT CAACAAGATG GACACCACCA AATGGTCTGA 3 00 

GGATCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

TCGGATACAA CCCCAAGTCC GTTCCATTCG TGCCCATCTC CGGCTTCAAC 4 00 

GGTGACAACA TGATCGACGT CTCCACCAAT GCCCCCTGGT ACAAGGGCTG 4 50 

GGAAAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTTGAGG 500 

3 0 CTATCGACTC CATCGACCCT CCTGCTCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCACTCC AGGATGTCTA CAAGATTTCT GGTATCGGCA CGGTGCCCGT 60 0 

CGGTCGTGTT GAGACTGGTA CCATCAAGGC CGGTATGGTC GTCACCTTTG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAA 700 

CAGCTCGCCG AGGGCGTTCC GGGTGACAAC GTCGGCTTCA ACGTCAAGAA 750 

35 CGTCTCCGTG AAGGAGGTTC GTCGTGGAAA CGTTGCTGGT GACTC CAAGA 8 00 

ACGACCCCCC CAAGGGTGCC GACTCCTTCA ACGCCCAGGT CATCGTCCTC 850 

AACCACCCTG GTCAGGTCGG TGCTGGCTAC GCCCCGGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA AGTTCTCTGA GCTCCTCGAG AAGATCGATC 950 

GTCGTACCGG CAAGTCCATG GAAAACAACC CCAAGTT CAT CAAGTCTGGT 1000 

4 0 GATGCTGCCA TCGTGAAGAT GGTTCCCAGC AAGC CTATGT GCGTTGAGGC 1050 

TTTCACCGAC TACCCTCCTC TTGGTCGTTT CGCCGTCCGT GAC 1093 



45 2) INFORMATION FOR SEQ ID NO: 780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN : ATCC 42115 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 0 



60 
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TACTTGTAAA TGGTCTCAAG ATCGTTACAA CGAAATTGTT AAGGAAGTTT 5 0 

CTTCCTTCAT CAAGAAGATT GGTTACAATC CTAAATCCGT TCCTTTCGTY 100 

CCTATCTCTG GTTGGCACGG TGATAACATG TTGGAAGCTT CTACCAACAT 15 0 

GCCTTGGTAC AAGGGATGGA CCAAGGAAAC TAAAGCTGGT TCTTCCACTG 2 00 

5 GTAAGACTCT CTTAGAAGCC ATTGACAGCA TTGAACCTCC TACCCGTCCT 250 

TCTGACAAGC CTTTACGTCT TCCTTTACAA GATGTTTACA AGATTGGTGG 3 00 

TATTGGTACT GTCCCTGTTG GTCGTGTTGA AACTGGTGTC ATCAAGGCTG 3 50 

GTATGGTTGT TACYTTCGCT CCCGCTAACG TCACCACTGA AGTTAAGTCC 400 

GTTGAAATGC ATCACGAACA ATTAGAACAA GGTGTTCCTG GTGACAACGT 4 50 

10 TGGTTTCAAC GTCAAGAACG TTTCCGTTAA GGATATCCGT CGTGGTAACG 50 0 

TCTGTTCCGA CTCCAAGAAC GACCCCGCTA AGGAATCTGC TTCCTTCAAC 550 

GCTCAAGTTA TCGTCTTGAA CCACCCTGGT CAAATTGGTG CTGGTTATGC 600 

CCCAGTTCTT GACTGTCACA CTGCTCACAT TGCTTGTAAG TTCGCTGAAT 650 

TATTAGAAAA GATCGATCGT CGTTCCGGTA AGAAACTCGA AGATGCTCCT 700 

15 AAATTCGTTA AATCTGGTGA CTCTGCTATC GTTAAGATGG TTCCTTCCAA 75 0 
GC 



20 2) INFORMATION FOR SEQ ID NO: 781 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

.30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 

(B) STRAIN: ATCC 26425 



35 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781 



2) INFORMATION FOR SEQ ID NO: 782 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
6 0 (D) TOPOLOGY: Linear 



752 



CAAGTGGTCT GAGGACCGTT AC CAGGAAAT CATCAAGGAG ACCTCCAACT 50 

TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 100 

TCCGGTTTCA ACGGAGACAA CATGATTGAG GCCTCCACCA ACTGCCCCTG 15 0 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 2 00 

4 0 TCCTTGAGGC CATCGACGCC ATCGACCCTC CTGTCCGTCC TACCGACAAG 2 50 

CCCCTCCGCC TTCCCCTCCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 3 00 

GGTCCCCGTC GGTCGTGTCG AGAC CGGTAT CATCAAGCCC GGTATGGTCG 350 

TCACCTTCGC CCCCGCTGGT GTCACCACCG AAGTCAAGTC CGTCGAGATG 400 

CACCACGAGC AGCT TACTGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 4 50 

4 5 CGT CAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 50 0 

ACTCCAAGAA CGACCCCCCC AAGGGTTGCG AGTCCTTCAA CGCCCAGGTC 5 50 

ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCCGGTTACG CCCCAGTCCT 600 

TGACTGCCAC ACTGCCCACA TTGCCTGCAA GTTCTCCGAG CTCCTCGAGA 650 

AGATCGACCG CCGTACCGGA AAGTCTGTTG AGAACTCCCC CAAGTTCATC 700 

50 AAGTCCGGTG ACGCCGCCAT CGTCAAGA 728 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782 

10 TGAAGTCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACTC CCAAGTAAGG CTCAACAGAC ACAACAAGCA AATGCATACT 100 

CGCTAACCTA TTCACCCACC ACAGGTACAA CGTCACCGTC ATTGACGCCC 15 0 

CCGGTCACCG TGATTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 200 

GACTGCGCCA TTCTCATCAT TGCCGCCGGT AC TGGTGAGT TCGAGGCCGG 25 0 

15 TATCTCCAAG GACGGTCAGA CCCGTGAGCA CGCTCTTCTC GCCTACACCC 300 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAG 350 

TGGTCTGAGG CCCGTTACCA GGAGATCATC AAGGAGACCT CCGGTTTCAT 4 00 

CAAGAAGGTC GGCTTCAACC CCAAGCACGT TCCCTTCGTG CCCATCTCCG 450 

GTTTCAACGG TGACAACATG ATCGACGTCT CCACCAACTG CCCCTGGTAC 500 

2 0 AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGCCATCG ACCCCCCCAC TCGTCCCACC GACAAGCCCC 600 
TCCGTCTTCC CCTYCAGGAT GTCTACAAGA TCGGTGGTAT TGGCACGGTT 650 
CCCGTCGGTC GTGTCGAGAC CGGTACCATC AAGGC CGGCA TGGTCGTCAC 700 
CTTCGCCCCC GCTGGTGTCA CCACTGAGGT CAAGTCCGTC GAGATGCACC 750 
25 ACGAGCAGCT TCCCGAGGGT CTCCCCGGTG ACAACGTCGG CTTCAACGTC 800 
AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGCAACGTCG CCGGTGACTC 8 50 

CAAGAACGAC CCCCCCAAGG GCTGCGACAG CTTCAACGCC CAGGTCATCG 900 
TCCTCAACCA CCCCGGTCAG GTCGGCGCCG GCTACGCGCC CGTCCTCGAC 950 

TGC CACACTG CTCACATTGC TTGCAAGTTC TCTGAGCTCC TCGAGAAGAT 10 00 

3 0 CGACCGCCGT ACCGGCAAGT CCATTGAGGC CAGCCCCAAG TT CAT CAAGT 1050 

CTGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAGCC TATGTGCGTT 110 0 

GAGGCCTTCA CCGACTACCC CCCTCTTGGA CGTTTCGCCG TCCGT 114 5 

35 

2) INFORMATION FOR SEQ ID NO: 783 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1151 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783 

GCTCAAGGCT GAGCGTGAGC GTGGTATCAC CATTGACATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC ATGGTCACCG TCATCGGTAT GCTTTATCTG 100 

TTTCCCATTT ATAGTTGCGA CAAGTAACTA ATAAAAAGTA GATGCCCCCG 15 0 

55 GACACCGTGA CTTCATCAAG AACATGATTA CTGGTACCTC CCAGGCCGAC 2 00 

TGCGCTATTC TCATCATTGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 2 50 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCT TTCACCCTCG 3 00 

GTGTCAAGCA GCTCATCGTT GCCATCAACA AGATGGACAC CACCAACTGG 350 

TCTGAGTCCC GTTTCGGTGA AATCATCAAG GAAGTCACCA ACTT CATCAA 4 00 

60 GAAGGTCGGC TACGACCCCA AGGGTGTCCC ATTCGTCCCA ATCTCTGGCT 4 50 
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TCAACGGTGA CAACATGATT GAGCCCTCCA 
GGATGGAACA AGGAGACCAA GGCCGGTGGC 
CCTTGAGGCC ATCGATGCCA TTGACATGCC 
CTCTCCGTCT CCCACTCCAG GATGTCTACA 
5 GTACCAGTCG GTCGTGTTGA GACTGGTATC 
CACTTTCGCC CCCGCCAACG TCACCACTGA 
ACCACCAGCA GCTCGTTCAG GGTGTTCCCG 
GTCAAGAACG TCTCTGTCAA GGAAGTCCGC 
TTCCAAGAAC GACCCACCAT CTGGCTGCGC 

10 TCGTCCTCAA CCACCCCGGC CAGATCGGTG 
GACTGCCACA CTGCCCACAT TGCTTGCAAG 
GATTGACCGC CGTACTGGTA AATCCGTCGA 
AGTCTGGTGA TGCCGCTATT GCCACCATGG 
GTTGAGGCTT TCACTGACTA CCCACCACTT 

15 C 



CCAACTGCCC ATGGTACAAG 500 

AAATCCTCTG GTAAGACCCT 550 

CACTCGTCCC ACCGACAAGC 600 

AGATCTCTGG T AT CGGAACA 650 

ATCAAGCCTG GTATGGTTGT 7 00 

AGTCAAGTCC GTC GAAATGC 750 

GTGACAACGT TGGCTTCAAC 800 

CGTGGTAACG TTGCCGGTGA 850 
CTCTTTCAAG GCCCAGGTCA 900 
CTGGTTACGC CCCAGTCCTC 950 

TTCTCTGAGC TTCTTGAGAA 1000 

AACCAGCCCT AAGTTCGTCA 1050 

TTCCATCCAA GCCCATGTGC 1100 

GGTCGTTTCG CCGTCCGTGA 1150 

1151 



2) INFORMATION FOR SEQ ID NO: 784 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 97 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Mucor circinelloides 

'(B) STRAIN: ATCC 38592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784 

3 5 CCAAGTGGTC TCAAGATCGT TACAACGAAA TTGTCAAGGA AGTTTCCGGT 50 

TTCATCAAGA AGAT CGGTTT CAACCCCAAG TCCGTTCCTT TCGTTCCCAT 100 

TTCTGGCTGG CACGGTGATA ACATGTTGGA TGAATCCACC AACATGC C CT 150 

GGTTCAAGGG ATGGAACAAG GAGACCAAGG CCGGTTCCAA GACTGGTAAG 2 00 

ACTCTCCTCG AAGCCATCGA TGCCATTGAG CCCCCTGTCC GTCCTTCTGA 2 50 

4 0 CAAGCCTCTC CGTCTTCCTC TTCAAGATGT CTACAAGATT GGTGGTATTG 3 00 

GTACAGTTCC CGTCGGTCGT GTTGAAACTG GTACTATCAA GGCTGGTATG 350 

GTTGTCAACT TCGCTCCCGC TGCTGTCACC ACTGAAGTTA AGTCCGTTGA 400 

AATGCATCAC GAAACCCTCT CTGAAGGTCT CCCCGGTGAC AACGTTGGTT 4 50 

TCAACGTCAA GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT 500 

45 TCCGACTCCA AGAACGATCC CGCTAAGGAA TCTGCCTCTT TCACTGCTCA 55 0 

AGTTATTATC TTGAACCATC- CCGGTCAAAT CTCTGCTGGT TACGCAC CAG 600 

TTCTCGATTG TCACACTGCT CACATCGCCT GTAAGTTCTC TGAACTCATT 650 

GAGAAGATTG ATCGTCGTTC CGGTAAGTAC CTGCATCTGT CAGAATTGAA 70 0 

GGTCCGCCGT TATAGCAAAG GCTGGGTTTA AATGTTGGGG TTTGTCTGAT 750 

50 CTATAATGAT GATTGCTCCT TCAATTTTTG ACATAATTTG ATGATCTGAA 800 

TTGTGTTGCT AACGTCGCAT TTGCTTCTTT TGCTTCTTTT GCATGTAGGT 850 

AAGAAGATGG AAGATGCTCC CAAAGTAAGT ATTACGATTG ATGGACAATT 900 

AAAATAGAAT ACTAACAATT ATTGTTTATA GTTCGTCAAG TCTGGTGACT 950 

CTGCTATCGT CAAGATGGTT CCCTCCAAG 979 

55 



60 



2) INFORMATION FOR SEQ ID NO: 785 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phialaphora verrucosa 
10 (B) STRAIN: ATCC 38561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785 

GGACAAGCTG AAGGCCGAAC GTGAGCGTGG TATCACCATC GATATCGCGC 50 

15 TCTGGAAGTT CGAGACTCCC AAATACTTCG TCACCGTCAT TGATGCCCCT 100 

GGTCATCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 150 

TTGTGCCATC CTCATCATTG CCGCCGGTAC CGGTGAGTTC GAAGC CGGTA 200 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTCCTCGC CTACACCCTA 250 

GGTGTGAAGC AGCTTATCGT CGCCATCAAC AAGATGGACA CCGCCAAATG 300 

2 0 GTCCGAGGAT CGGTTCAACG AAATCATCAA GGAGACTTCC AACTTCATCA 350 

AGAAGGTCGG ATACAACCCC AAGTCCGTCC CGTTCGTGCC CATCTCCGGT 4 00 

TTCAACGGTG ACAACATGAT CGACGTCTCC TCCAACGCCC CCTGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGCAA GGCCACTGGC AAGACCCTCC 50 0 

TCGAGGCCAT CGACGCGATT GACCCTCCTA CTCGTCCCAC CGACAAGCCC 550 

25 CTCCGTCTCC CTCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGCACGGT 600 

GCCCGTTGGT CGTGTTGAGA CCGGTACCAT CAAAGCCGGT ATGGTCGTCA 650 

CCTTCGCTCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAAATGCAC 700 

C ACGAAC AG C TCGCCGAAGG TGTTCCAGGT GACAATGTCG GCTTCAACGT 750 

CAAGAACGTC TCCGTCAAGG AGGTTCGTCG TGGAAACGTT GCCGGTGACT 800 

3 0 CCAAGAATGA CCCCCCCAAG GGTGCCGACT CCTTCAACGC TCAGGTCATC 850 

GTCCTCAACC ACCCTGGTCA GGTCGGTGCC GGCTACGCCC CGGTCTTGGA 900 

TTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTC CTCGAGAAGA 950 

TCGATCGTCG TAC CGGCAAG TCCATGGAAA ACAACCCCAA GTTTATCAAG 10 0 0 

TCTGGTGATG CTGCCATCGT GAAGATGGTT CCCAGCAAGC CTATGTGCGT 1050 

35 TGAGGCCTTC ACCGACTATC CTCCTCTTGG TCGTTTCGCC GTCCGTGAC 1099 



2) INFORMATION FOR SEQ ID NO: 786 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNES S : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Saksenaea vaaiformis 

(B) STRAIN: ATCC 60625 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786 

55 ACCACCAAGT GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC 50 

CGGYTTCATC AAGAAGGTCG GCTTCAACCC CAAGCACGTT CCCTTCGTGC 100 

CCATCTCCGG TTTCAACGGT GACAACATGA TCGACGTCTC CACCAACTGC 150 

CCCTGGTACA AGGGTTGGGA GAAGGAGACC AAGGCCAAGG CCACCGGCAA 2 00 

GACCMTCCTC GAGGCCATTG ACGCCATCGA CCCCCCYAGY CGTCCCACCG 2 50 

60 ACAAGCCCCT YCGTCTTCCC CTMCAGGATG TYTACAAGAT TGGCGGTATT 3 00 
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GGCACGGTTC CCGTCGGTCG TGTYGAGACC 
GGTCGTCACC TTCCCCCCCG CTGGTGTCAC 
AGATGCACCA CGAGCAGCTC GCCGAGGGTS 
TTCAACGTCA AGAACGTCTC CGTCAAGGAG 
5 CGGTGACTCC AAGAACGACC CCCCCAAGGG 
AGGTCATCGT CCTCAACCAC CCCGGTCAGG 
GTCCTSGACT GCCACACTGC TCACATTGCC 
CGAGAAGATC GACCGCCGTT CCGGCAAGTC 
TCATCAAGTC TGGTGACGCC GCCATCGTCA 



GGTRCCATCA AGGGTGGCAT 350 

CACTGAGGTC AAGTCCGTCG 4 00 

TCCCCGGTGA CAACGTCGGC 4 50 

ATCCGTCGTG GCAACGTTGC 50 0 

CTGCGACAGC TTCAACGCCC 550 

TCGGCGCCGG CTACGCGCCS 60 0 

TGCAAGTTCT CTGAGCTCCT 650 

CATCGAGTCC GGCCCCAAGT 7 00 

AGATGGTTCC CTCCAAGCCC 750 



2) INFORMATION FOR SEQ ID NO: 787 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Syncephalastrum racemosum 

2 5 (B) STRAIN: ATCC 3233 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878 

GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA 50 

3 0 aS??cgagac CCCCAAGTAC CACGTCACCG TCATTGATGC CCCCGGCCAT 100 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CTGACTGCGG 150 

??tcctStc ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 2 00 

IrGATGGCCA GACCCGTGAG CACGCTCTGC TTGCCTTCAC CCTCGGTGTC 250 

CG?CAotS ?CGTCGCCAT CAACAAGATG GACTCGACCA AGTACTCTGA 300 

3 5 GGCCCGTTAC AACGAAATCG TCAAGGAGGT CTCCACCTTC ATCAAGAAGA 3 50 

?PGCT?TCAA CCCCAAGTCC GTTCCCTTCG TCCCCATCTC TGGCTGGAAC 4 00 

S?§A?SS TGTTGGAGGA GTCCTCCAAC ATGCCCTGGT TCAAGGGCTG 450 

SIgAAGGAG ACCAAGGCTG GCGAGAAGTC CGGCAAGACC CTCCTTGAGG 500 

S^TTGACAA CATTGACCCC CCGGTCCGTC CCTCGGACAA GCCCCTCCGT 550 

4 0 CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATCGGCA CAGTCCCCGT 6 00 

SgTCGTGTC GAGACTGGTG TCATCAAGGC TGGTATGGTC GTGACCTTCG 650 

ScCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 700 

CAGCTCGTCG AGGGTGTCCC CGGTGACAAC GTCGGTTTCA ACGTCAAGAA 75 0 

StTTCCGTC AAGGATATCC GCCGTGGTAA CGTCTGCTCT GACTCCAAGA 800 

45 ACGACCCCGC CAAGGAGTCT GCCTCGTTCA CCGCCCAGGT CATCGTCCTG 850 

aa?caccccg GTCAGATCGG TGCCGGTTAC GCCCCGGTTC TTGACTGCCA 900 

CACCGCTCAC ATTGCCTGCA AGTTCGCTGA GCTCCTCGAG AAGATCGACC 950 

GTCGTTCCGG YAAGAAGCTC GAAGAGTCCC CCAAGTTCGT CAAGT CGGGT 1000 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 10 50 

50 CTACACTGAG TACCCCCCTC TTGGCCGTTT CGCC 10 84 



2) INFORMATION FOR SEQ ID NO: 788 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 788 

10 GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 
AGTTCGAGAC CCCCAAGTAC AATGTCACCG TCATTGGTAT GTTTTTCTTT 100 
ACCTTTCCCC TCCATCGTCT TGCTGTGCCA TAACTAACGA GAGTAGACGC 150 
CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG 200 
CTGACTGTGC TATTCTCATC ATTGCTGCCG GTACTGGTGA GTTCGAGGCT 25 0 

15 GGTATCTCCA AGGATGGC CA GACCCGTGAG CACGCTCTGC TCGCCTTCAC 300 
CCTTGGTGTC AAGCAGCTCA TCGTTGCCAT CAACAAGATG GACACCACCA 350 
ACTGGTCCGA GGACCGTTTC AAGGAAATCA TCAAGGAAGT CACCAACTTC 40 0 

ATCAAGAAGG TTGGCTACGA CCCCAAGGGT GTTCCATTCG TTCCAATCTC 4 50 

TGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCAGCAAC TGCCCATGGT 500 

20 ACAAGGGATG GAACAAGGAG ACCAAGGCCG GTGGTGCCAA GACTGGCAAG 55 0 

ACCCTYCTCG AGGCCATCGA TGCCATCGAC ATGCCAACCC GTCCTACCGA 60 0 

CAAGCCCCTY CGTCTCCCAC TCCAGGATGT CTACAAGATC TCTGGTATCG 650 
GAACTGTACC AGTCGGTCGT GTTGAGACCG GTATCATCAA GCCTGGTATG 700 
GTCGTCACCT TCGCCCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA 75 0 

2 5 AATGCACCAC CAGCAGCTTC AGCAGGGTGT CCCCGGTGAC AACGTCGGCT 8 00 

TCAACGTCAA GAACGTTTCC GTCAAGGAAG TCCGCCGTGG TAACGTTGCC 85 0 

GGTGACTCCA AGAACGACCC ACCATCCGGC TGTGCCTCCT TCAACGCCCA 90 0 

GGTCATCGTC CTCAACCACC CCGGCCAGAT CGGTGCTGGT TACGCCCCAG 950 

TCCTCGACTG CCACACTGCT CACATTGCTT GCAAGTTCGC TGAGCTCCTC 100 0 

3 0 GAGAAGATTG ACCGCCGTAC CGGTAAATCC GTCGAAGCCA AC CCCAAGTT 10 50 

. CGTCAAGTCT GGTGATG CCG CTATCGCCAA GATGGTTCCC TCCAAGCCCA 110 0 

TGTGCGTTGA GGCTTTCACT GACTACCCCC CACTTGGTCG TTTCGCCGTC 1150 

CGTGA 1155 



35 



50 



2) INFORMATION FOR SEQ ID NO: 789 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 113 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 9 



TCAAGGCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACCC CCAAGTACAA TGTCACCGTC ATTGGTATGT TTCTCTTTAC 100 

55 CTTTCCCCTC CATCGTCTTG CTGTGCCATA ACTAACGAGA GTAGACGCCC 150 

CCGGTCACCG TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 2 00 

GACTGTGCTA TTCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG 2 50 

TATCTCCAAG GATGGCCAGA CCCGTGAGCA CGCTCTGCTC GCCTTCACCC 3 00 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAC 3 50 

60 TGGTCCGAGG ACCGTTTCAA GGAAATCATC AAGGAAG TCA CCAACTTCAT 4 00 
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CAAGAAGGTT GGCTACGACC CCAAGGGTGT 
GTTTCAACGG TGACAACATG ATTGAGGCCT 
AAGGGATGGA ACAAGGAGAC CAAGGCCGGT 
CCTCCTCGAG GCCATCGATG CCATCGACAT 
5 AGCCCCTCCG TCTCCCACTC CAGGATGTCT 
ACTGTACCAG TCGGTCGTGT TGAGACCGGT 
CGTCACCTTC GCCCCTGCCA ACGTCACCAC 
TGCACCACCA GCAGCTTCAG CAGGGTGTCC 
AAC GTCAAGA ACGTTTCCGT CAAGGAAGTC 

10 TGACTCCAAG AACGACCCAC CATCCGGCTG 
TCATCGTCCT CAACCACCCC GGCCAGATCG 
CTCGACTGCC ACACTGCTCA CATTGCTTGC 
GAAGATTGAC CGCCGTACCG GTAAATCCGT 
TCAAGTCTGG TGATGCCGCT ATCGCCAAGA 

15 TGCGTTGAGG CTTTCACTGA CTACCCCCCA 



TCCATTCGTT CCAATCTCTG 450 

CCACCAACTG CCCATGGTAC 50 0 

GGTGCCAAGA CTGGCAAGAC 550 

GCCAACCCGT CCTACCGACA 600 

ACAAGATCTC TGGTATCGGA 650 

ATCATCAAGC CCGGTATGGT 7 00 

TGAAGTCAAG TCCGTCGAAA 750 

CCGGTGACAA CGTCGGCTTC 8 00 

CGCCGTGGTA ACGTTGCCGG 850 

TGCCTCCTTC AACGCCCAGG 90 0 
GTGCTGGTTA CGCCCCAGTC 950 

AAGTTCGCTG AGCTCCTCGA 1000 

CGAAGCCAAC CCCAAGTTCG 1050 

TGGTTCCATC C AAGC CTATG 1100 

CTTGGTCG 1138 



2) INFORMATION FOR SEQ ID NO: 790 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Bipolaris hawaiiensis 

(B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790 

, 5 CACCACCAAG TGGTCTGAGG AGCGTTACCA GGAAATCATC AAGGAGACCT 50 

CCAACTTCAT CAAGAAGGTC GGCTACAACC CCAAGCACGT TCCCTTCGTC 100 

CCCATCTCTG GTTTCAACGG AGACAACATG ATTGAGGCTT CCACCAACTG 150 

CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACTGGTA 200 

AGACCCTTCT CGAGGCCATC GACGCCATCG ACCCCCCTGT CCGTCCTACC 2 50 

4 0 GACAAGCCCC TCCGCCTTCC CCTCCAGGAT GTGTACAAGA TTGGTGGTAT 3 00 
~ TGGCACGGTT CCCGTCGGTC GTGTCGAGAC CGGTATCATC AAGCCCGGTA 3 50 

TGGTCGTCAC CTTCGCCCCC GCTGGTGTCA CCACTGAAGT CAAGTCCGTC 4 00 

GAGATG C AC C ACGAGCAGCT TGCCGAGGGT GTCCCCGGTG ACAACGTCGG 4 50 

CTTCAACGTC AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGTAACGTTG 500 

4 5 CCGGTGACTC CAAGAACGAC CCCCCCAAGG GTTGCGAGTC CTTCAACGCC 550 

CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCCG GTTACGCACC 600 

AGTCCTTGAC TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGCTCC 650 

TCGAGAAGAT TGACCGCCGT ACCGGAAAGT CTGTTGAGAA CTCCCCCAAG 700 

TT CAT CAAGT CCGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAG 74 8 

50 



2) INFORMATION FOR SEQ ID NO: 791 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
5 (B) STRAIN: ATCC 14110 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791 

CGCTATTGTC GTTGTTGCTG CCTCCGACGG TCAGATGTAG GTGGAACATC 5 0 

10 TTGGGAAATA CGTCGTAAAA CACGGCGCTT ACGTTTTCGC GAATAGGCCC 100 

CAGACTCGTG AGCATTTGCT GCTCGCCCGC CAGGTTGGTG TCCAGAAGAT 150 

CGTTGTCTTC GTCAACAAAA TCGATGCTAT TGATGATCCG GAGATGCTGG 2 00 

AACTGGTCGA ACTCGAGATG CGTGAGCTGC TGAACAGCTA CGGTTTCGAG 25 0 

GGTGAAGAGA CTCCGATCAT TTTCGGTTCC GCTCTCTGTG CTCTCGAAGG 3 00 

15 ACGCCGTGAC GACAT CGGTA AAGACAGAAT TGAGCAGCTT ATGAACGCTG 35 0 

TCGACACCTG GATCCCCACT CCTCAGCGTG ACCTCGACAA ACCTTTCTTG 400 

ATGTCTGTCG AGGAAGTGTT CTCTATCGCC GGCCGTGGTA CCGTGGCTTC 45 0 

TGGTCGTGTC GAGCGTGGTA TCTTGAAGAA GGACTCTGAG GTTGAGATTG 500 

TTGGAGGCTC CTTCGAACCC AAGAAGACCA AAGTCACCGA CATTGAAACC 550 

2 0 TTCAAGAAGA GCTGTGATGA ATCGCGTGCT GGTGACAACT CTGGTCTCCT 6 00 

CCTGCGTGGT ATCCGACGTG AAGACGTCAA GCGTGGTATG GTCATTGCTG 650 

TTCCCGGCAG CACCAAAGCT CACGACAAGT TCCTCGTCTC CATGTACGTC 700 

CTGACCGAGG CGGAGGGTGG TCGTCGTACT GGCTTCGGTG CCAACTACCG 75 0 

TCCCCAAGTC TTCATCCGTA CTGCAGGTAA GTTCCCGCAC AC CGTGTCCA 8 00 

2 5 GATCTTCCGA GAGATTAGCG ATATATGCTA ATGATTCATC AGACGAGGCT 850 

GCTGACCTCA GCTTCCCTGA CGGCGAC CAA TCTCGCAGAG TTATGCCTGG 90 0 

TGACAACGTC GAGATGATCC TGAAGACCCA CCACCCTGTT GCTGCTGAGG 950 

CTGGTCAA 958 



30 



45 



2) INFORMATION FOR SEQ ID NO: 792 



(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 93 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792 



CGTTGTCGCA GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACG 50 

GATGAAAGGA TTTGACGTTT CTAACATCAT TCTAGGCCTC AGACCAGAGA 100 

50 ACATTTGCTC CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG 150 

TTAACAAGGT CGATGCCGTT GAGGACCCAG AGATGTTGGA GCTTGTCGAA 200 

CTTGAAATGC GTGAACTCCT CAGCCACTAC GGTTTCGAGG GTGAGGAGAC 2 50 

CCCCATCATT TTTGGCTCTG CTCTCTGTGC CCTCGAGTCC CGTCGACCTG 3 00 

AGCTTGGTGT CGAGAAGATT GACGAGCTAT TGAACGCCGT CGACACCTGG 3 50 

55 ATCCCCACCC CCGAGCGCGC CACTGATAAG CCTTTCCTCA TGTCCATTGA 4 00 

GGAAGTGTTC TCTATCTCTG GTCGTGGTAC CGTCGTCTCC GGTCGTGTTG 4 50 

AGCGTGGTAT CCTCAAGAAG GATTCCGACG TCGAAATTGT TGGTGGCTCT 500 

ACCACCCCTA TCAAGACCAA GGTCACAGAT ATCGAAACCT TCAAGAAGTC 550 

CTGCGATGAA TCTCGAGCTG GTGACAACTC TGGTCTCCTT CTCCGAGGTA 6 00 

60 TCAAGCGTGA GGACTTGAAG CGTGGAATGG TTGTTGCTGC CCCCGGATCC 650 
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ACCAAGGCTC ACACCGACTT 
TGAGGGTGGT CGTTCCAACG 
TCATCCGTAC TGCTGGTATG 
ATCATTGCTA ACTTGTATTT 
5 TGGCCTGGAG AAGACCAAGA 
GATGATTTGC AAGACCCTCC 



CATGGTCTCC CTCTACGTCC 
GCTTCACCCA CAAGTACCGC 
TAACCCAAGT TTCCGCTATT 
CCTTCCGTAG ACGAAGCCGC 
CAAGAAGGCT ATGCCTGGTG 
ACCCCATTGC TGCCGA 



TGACTGAGGC 7 00 

CCCCAAATGT 75 0 

TACTAAGTAG 800 
ATCTTTCAGC 850 
ACAACGTCGA 90 0 

936 



10 2) INFORMATION FOR SEQ ID NO: 793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793 

TTATTGTTGC TGCTGGTACT 



2) INFORMATION FOR SEQ ID NO: 794 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 
ORIGINAL SOURCE: 

(A) ORGANISM: Tirypanosoma cruzi 
SEQUENCE DESCRIPTION: SEQ ID NO: 794 
GACGACAAGT CGGTGAACTT 
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(ii) 

35 

(vi) 



(xi) 



45 2) INFORMATION FOR SEQ ID NO: 795 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795 
ACTTGCACGC GATGTGGCAG 2 0 
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2) INFORMATION FOR SEQ ID NO: 796 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796 
GGTCCAATGC CWCAAACWAG A 21 



15 



20 2) INFORMATION FOR SEQ ID NO: 797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 97 

CATTAAGAAT GGYTTATCTG" TSKCTCT 27 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 798 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798 
TGGTTGTCCC AGCCGATCGT TT 22 



55 2) INFORMATION FOR SEQ ID NO: 799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799 
ACCTGTGAAT ACAAGCAATC T 

2) INFORMATION FOR SEQ ID NO: 800 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800 
GATGAAATCT TCAACGAAGT TGAT 



2) INFORMATION FOR SEQ ID NO: 801 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801 
ACAACACCGA GAAGATCCCA 



2) INFORMATION FOR SEQ ID NO: 802 ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802 

5 

TTGC CATTTC TGGTTTCGTT 2 0 

10 2) INFORMATION FOR SEQ ID NO: 803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 03 
2 5 ACTTCAGTGG TAACACCAGC 20 
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45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 804 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804 
CCTGGGACGG CCTCTGGCAT 2 0 

2) INFORMATION FOR SEQ ID NO: 805 

5 0 (i) SEQUENCE CHARACTERISTICS : 

(A) . LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805 
CTCTTGTCCA TCTTAGCAGT 

2) INFORMATION FOR SEQ ID NO: 806 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 06 

AGCATCACCA GACTTGATAA G 



2) INFORMATION FOR SEQ ID NO: 807 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807 
AAAGTGGCTT CAAAGGTTGC 



2) INFORMATION FOR SEQ ID NO: 808 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 08 
GCITTAIWRG CATTAGAARA YCCA 
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10 



15 



2) INFORMATION FOR SEQ ID NO: 809 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809 
TCTTCCTGTW GCAACTGTTC CTCT 2 4 

2) INFORMATION FOR SEQ ID NO: 810 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810 

AGAGMWACAG ATAARSCATT CTTA 24 
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40 



45 



2) INFORMATION FOR SEQ ID NO: 811 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811 
TRAARTAGAA TTGTGGTCTR TATCC 25 

2) INFORMATION FOR SEQ ID NO: 812 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 
GTIACIGGIT CYTYRARRTT ICCICC 2 6 

60 
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10 



20 



30 



50 



2) INFORMATION FOR SEQ ID NO: 813 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813 
15 AATCYGTYGA AATGCAYCAC GA 22 



2) INFORMATION FOR SEQ ID NO: 814 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814 
GC I GGCACGT ACACIGCCTG 2 0 



35 2) INFORMATION FOR SEQ ID NO: 815 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 
4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815 

TGGTGCATYT CKACRGACTT 2 0 



2) INFORMATION FOR SEQ ID NO: 816 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 816 
GCTACGACGA GATCAAGGGC 2 0 

5 

2) INFORMATION FOR SEQ ID NO: 817 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817 
TGGAAGAAGG CCGAGGAGTT 2 0 

20 

2) INFORMATION FOR SEQ ID NO: 818 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818 
35 AGCCGGGCTG GATCTTCTTC 20 



40 



2) INFORMATION FOR SEQ ID NO: 819 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 819 

50 

TCGAGCTTCT GGAGGAAGAG 2 0 



55 2 ) INFORMATION FOR SEQ ID NO: 82 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820 
10 GAAGGAGGTG TCTGCTTACA C 21 



15 



2) INFORMATION FOR SEQ ID NO: 821 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
2 5 (A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821 

GGCGCAAACG TCACCACATC A 21 

30 



2) INFORMATION FOR SEQ ID NO: 822 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822 
CGGCGGATGT CCTTAACAGA A 



2) INFORMATION FOR SEQ ID NO: 823 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single . 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823 
GAGCGGTATG AYGAGATTGT 2 0 

5 

2) INFORMATION FOR SEQ ID NO: 824 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824 
2 0 GGCTTCTGCG GCACCATGCG 2 0 



25 



55 



2) INFORMATION FOR SEQ ID NO: 825 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825 

35 

ATGAGCARCG SAACCATCGT TCAGTG 26 



40 2) INFORMATION FOR SEQ ID NO: 826 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 6 

TCGATCGTGC CGACCATGTA GAACGC 26 



2) INFORMATION FOR SEQ ID NO: 827 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 6 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 194 02 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827 

CACCAACTTG CTAAATGGGG AGATGCC CAG ATTGTTGTAT ATATAGGCTG 50 

TGGAGAACGT GGAAATGAAA TGACAGATGT TCTTAATGAG TTTCCAGAAC 100 

TTAAAGATCC TAAGACTGGC AAATCAATAA TGGAAAGAAC AGTTTTAATA 15 0 

15 GCAAATACTT CTAATATGCC AGTTGCAGCC CGTGAAGCTT GTATATATAC 200 

AGGAATCACA ATAGCAGAAT ATTTTAGAGA TATGGGATAT TCAGTAGCAC 25 0 

TTATGG CGGA TTCCACTTCA CGTTGGGCAG AGGCATTAAG AGAAATGTCT 3 00 

GGAAGACTTG AAGAAATGCC TGGTGATGAA GGTTACCCAG CTTATTTAGG 350 

AT C AAGACTT GCTGATTTCT ATGAAAGAGC TGGAAAAGTT GTGTGTTTAG 4 00 

2 0 GAGACGATGA AAGAGAAGGT GCCATTACTG CAATAGGTGC TGTATC 44 6 



2) INFORMATION FOR SEQ ID NO: 828 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Clostridium difficile 

(B) STRAIN: 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828 

4 0 CAGCATCAGC TTGCTAAATG GGCAGATGCA GATATAGTTG TATATATAGG 50 

CTGTGGCGAG CGTGGAAATG AAATGACAGA TGTTCTTCTT GAATTTCCTG 100 

AATTAAAAGA CCCAAGAACA GGCGAGTCAC TTATGCAAAG AACTGTGCTT 150 

ATAGCAAATA CATCAGATAT GCCGGTTGCT GCACGTGAAG CTTCTATATA 200 

CACTGGTATT ACAATAGCTG AATATTTTAG AGATATGGGA TATAGTGTTG 250 

45 CACTTATGGC AGACTCTACA TCAAGATGGG CTGAGGCTCT TAGAGAGATG 3 00 

AGTGGTCGTT TAGAGGAGAT GCCTGGTGAA GAAGGTTATC CTGCATACTT 3 50 

AGGTTCACGT CTTGCTCAAT TCTATGAGAG AGCAGGAAAG GTAAATTGTC 4 00 

TAGGTATGGA TGAAAGAGAA GGAACACTTA CAGCAATTGG TGCAG 44 5 



50 



2) INFORMATION FOR SEQ ID NO: 82 9 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

6 0 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 9 

ATG CTATAGC TAAATGGGGA GACAGCGAAA TAGTTGTTTA CGTTGGATGT 50 

GGAGAACGTG GTAACGAAAT GACAGACGTT CTTAACGAAT TCCCAGAACT 100 

10 TATTGACCCA AAAACTGGGG AAAGTTTAAT GAAGAGAACA GTACTTATAG 15 0 

CTAATACTTC AAACATGCCA GTTGCTGCTA GAGAAG CTTG CATATACACA 200 

GGTATTACAA TAGCTGAATA CTTCAGAGAT ATGGGATACT CAGTATCTAT 250 

AATGGC TGAT TCAACTTCAA GATGGGCAGA AGCATTAAGA GAAATGTCAG 3 00 

GTAGAC TTGA AGAAATGCCA GGTGATGAAG GATATCCAGC GTACTTAGGA 350 

15 TCAAGACTTG CTGATTATTA CGAAAGAGCA GGTAAGGTTG TTTGTCTAGG 4 00 

TAAAGATGGT AGAGAAGGTG CTGTAACAGC AATTGGAGCT GTATC 44 5 



20 2) INFORMATION FOR SEQ ID NO: 830 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botullnum 

(B) STRAIN: 20:3.1 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 0 



TCAAATTGCT AAATGGGGAG ATGCAGAAAT CGTTGTTTAC GTTGGATGCG 50 

GAGAACGTGG TAACGAAATG ACAGACGTTG TTAATGAGTT CCCAGAACTT 100 

ATTGACCCTA AGACTGGCGA AAGCTTAATG AAGAGAACAG TTCTTATAGC 15 0 

TAATACTTCA AACATGCCAG TTGCAGCGAG AGAAGCTTCA ATATATACAG 200 

4 0 GTATCACAAT AGCTGAATAT TTCAGAGATA TGGGATATGC AGTATCAATA 25 0 

ATGGCTGACT CAACTTCAAG ATGGGCTGAG GCATTAAGAG AAATGTCTGG 3 00 

TAGAC TTGAA GAAATGCCTG GTGATGAAGG ATATC CAGCT TACCTTGGAT 350 

CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAGGTTGA ATGTTTAGGT 400 

AATGATGGAA GAATTGGTTC TATAACAGCA ATCGGTGCGG TATC 4 44 



2) INFORMATION FOR SEQ ID NO: 831 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 45 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ 

TAGTTCAGCA CCAAGTTGCT AAATGGGGAG 
GTTGGATGTG GAGAACGTGG TAACGAGATG 
5 CC CAGAAC TT AAAGAC C CT A AAACTGGGGA 
TTCTTATTGC TAATACATCT AACATGCCAG 
AT ATAT AC TG GTATAACAAT AGCAGAGTAT 
AGTATCAATC ATGGCTGACT CAACTTCACG 
AAATGTCAGG AAGACTTGAA GAAATGCCAG 
10 TACTTAGGAT CAAGACTTGC TGATTACTAT 
AGCTTTAGGT AAAGATGGAA GAGAAGGAGC 
TATCCC 



ID NO: 831 

ATACTGAGAT AGTTGTTTAC 50 

ACAGACGTTC TTAAC GAATT 100 

AAGCTTAATG AAGAGAACAG 150 

TTGCTGCCAG AGAAGCAT CA 2 00 

TT CAG AG ATA TGGGATACTC 2 50 

TTGGGCAGAG GCTTTAAGAG 3 00 

GAGACGAAGG TTACCCAGCA 350 

GAAAGAG CTG GTAAAGTTGT 40 0 

TGTTACAGCT ATCGGAGCAG 4 50 

456 



15 2) INFORMATION FOR SEQ ID NO: 832 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetanx 

(B) STRAIN: ATCC 19406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832 

30 CCAACTTGCA AAATGGGCTG ATGCTCAAAT AGTTGTGTAC ATAGGATGTG 50 

GAGAACGTGG AAATGAAATG ACAGACGTTT TAAATGAGTT CCCAGAATTA 100 

AAGGATCCTA AAACCGGGGA ATCTTTAATG AAAAGAAC TG TGTTAATAGC 150 

AAATACATCT AATATGCCTG TTGCAGC TAG AGAAGCATCT ATATATACTG 2 00 

GTATAACAAT AGGGGAATAT TTTAGAGATA TGGGATATTC AATAGC AC TA 250 

ATGGCAGATT CGACTTCTAG ATGGGCAGAG GCTCTAAGAG AAATGTCTGG 3 00 

AAGACTAGAG GAG ATGC CAG GTGAAGAAGG TTATCCAGCT TATTTAGGAT 3 50 

CTAGATTAGC AGAGTTCTAT GAAAGAGCAG GTAATGTTAT ATGTTTAGGT 400 

CAGGATGGAA GAGAAGGAGC ATTAACAGCT ATAGGAGCAG TTTC 444 

40 



2) INFORMATION FOR SEQ ID NO: 833 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 3 

TGAAC CAAGG AAAAATAATA AC CGTTT CGG GACCTCTTGT TGTGGCTTCT 
GGGATGCAAG AAGCTAATAT TCAAGATATT TGTCGTGTGG GACATCTTGG 
6 0 CTTAGTCGGA GAAATTATTG AAATGCGTCG CGATCAAGCG TCTATTCAGG 
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TTTATGAGGA AACATCAGGG ATCGGTCCAG GAGAACCAGT AGTGACTACT 200 

GGTTGTCCTT TGTCGGTCGA GTTAGGCCCG GGCCTGATTT CAGAAATGTT 2 50 

TGACGGTATT CAGCGACCGC TTGATCGTTT TCAAAAAGCA ACGGACAGCG 3 00 

ACTTTTTAAT CCGTGGTGTG GCTATCCCAA GTCTTGATCG AAAGGCTAAG 350 

5 TGGGCATTTA TTCCCAAGCT AAGTGTTGGT CAAGAAGTAG TTGCAGGTGA 4 00 

TATTTTAGGA ACTGTGCAAG AAACAGCTGT CATTGAGCAC CGTATCATGG 450 

TTCCTTATAA AGTTTCAGGG ACCTTGGTGG CTATTCATGC AGGGGAC TTC 500 

ACAGTAACAG ATACAGTTTA TGAAATTAAG CAGGAAGACG GTTCCATTTA 550 

CCAAGGTAGC CTCATGCAGA CTTGGCCAGT TCGTCAAAGT CGCCCTGTTG 6 00 

10 CTCAAAAGCT TATCCCAGTC GAACCTTTGG TTACAGGTCA ACGGGTTATT 650 

GACACCTTTT TCCCTGTTAC AAAAGGTGGT GCCGCTGCCG TTCCTGGACC 7 00 

ATTTGGGGCA GGAAAAACAG TTGTGCAGCA TCAAATAGCT AAATTTGC C A 750 

ACGTTGATAT TGTTATTTAT GTCGGTTGTG GGGAACGCGG CAACGAGATG 800 

ACCGACGTTT TGAATGAGTT TCCAGAGTTA ATTGACCCAA ATACAGGCCA 8 50 

15 GTCCATTATG GAGCGCACGG TGTTAATTGC AAACACCTCT AATATGC CAG 900 

TAGCAGCGCG TGAAGCGTCG ATTTACACAG GTATTACCAT TGCCGAATAT 950 

TTCCGTGATA TGGGCTATTC TGTGGCTATC ATGGGAGAC T CGACATCACG 1000 

TTGGGCAGAA GCTCTGCGCG AGATGT CAGG ACGCCTACAA GAAATGCCTG 1050 

GTGATGAAGG CTACCCGGCT TACTTAGGGA GTCGTATTGC CGAATATTAT 1100 

2 0 GAACGGGCTG GTCGTGTTCG GACCTTGGGA AGTCAAGAAC GTGAGGGAAC 1150 

CATTACAGCC ATCGGCGCGG TTTCTCCTCC TGGAGGGGAT ATTTCAGAGC 12 00 

CTGTCACTCA AAACACCCTT CGGATTGTCA AAGTTTTCTG GGGGCTCGAC 1250 

GCGCCTCTTG CGCAACGGCG TCACTTCCCA GCGATTAACT GGCTGACGTC 1300 

TTATTCATTG TAT CAAGATG ATGTAGGAAG CTATATTGAC CGTAAACAGC 1350 

2 5 AATC TAATTG GTCCAACAAG GTAACTCGTG CCATGGCTAT TTTGCAGCGT 14 0 0 

GAAGCCAGTC TAGAAGAAAT TGTACGCTTG GTGGGGCTTG ATTCACTGTC 1450 

TGAACAAGAT CGTTTGACCA TGGCTGTTGC CCGGCAAATT CGGGAGGATT 1500 

ATCTCCAGCA AAATGCCTTT GATTCGGTGG ATACCTTTAC TTCCTTTCCG 1550 

AAACAAGAGG CCATGCTAAC CAATATTTTG ACCTTTAATG AGGAAG C CAG 160 0 

3 0 CAAAGCCCTT TCTTTGGGAG CTTATTTTAA TGAGATTATG GAAGGCACTG 1650 

CTCAGGTACG CGATCGCATC GCACGCAGCA AATTTATCCC AGAAGAAAAC 17 00 

TTAGAGCAGA TTAAAGGGCT TACT CAGAAG GTTACCAAAG AGATTC AC CA 1750 

CGTTTTAGCA AAGGGAGGAA TTTAGATGAG CGTTCT 1786 



35 



50 



2) INFORMATION FOR SEQ ID NO: 834 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 499 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia, bovis 

(B) STRAIN: Suarez-3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834 



TATCTCACGT AAGTTTTTGC GCGGCGGTTA TATATCACTC CAGGCC CTGG 5 0 

CTAAGTACGC TAATAC TGAC GTTACTGTCT ATGTGGGATG TGGAGAGCGT 10 0 

55 GGAAACGAGA TTGCGGAGGT GCTTAAGGAG TTCCCTGAGC TGAAGACCAA 15 0 

GGTTGATGGC AAGGAAGTGA GCATTATGAA ACGCACTTGC TTGGTGGC CA 200 

ATACTTCAAA CATGCCAGTG GCCGCCAGGG AGGCTAGTAT CTACACTGGC 25 0 

ATTACCCTAT GTGAAT AC TT CAGGGATATG GGATACAACG CCTGTGTGAT 3 00 

GGCGGATTCC ACCAGTCGTT GGGC TGAGGC TTTGCGTGAG ATATCAGGTC 3 50 

60 GTTTAGC TGA GATGCCTGCT GATTCAGGTT ATCCCGCCTA CCTTGCTTCT 4 00 
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AGGCTTTCGG CGTTCTATGA GCGTGCTGGT ACAGCTGAGT GTATTGGAAC 4 50 

ACCACTTCGT GAAGGTTCAG TTACCATTGT TGGTGCTGTA TCTCCACCA 4 99 



2) INFORMATION FOR SEQ ID NO: 835 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 64 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 5 

TTTCTCAAGC TTTGAGTAAA TATAGTAACT CTGATGTTAT TATTTACATT 50 

GGTTGTGGAG AAAGAGGAAA TGAAATGGCA GAAGTTCTTA CAGAATTCCC 10 0 

TGAGCTTTAT ACTATGGTTG ATGGAAAGAA GGAGTCAATT ATGCAAAGAA 150 

CTTGTTTAGT AGCTAATACA T CAAATATGC CTGTCGCTGC TAGAGAAGCT 2 00 

9<5 TCCATCTACA CTGGTATTAC ACTTTCTGAA TACTTTAGAG ATATGGGATG 250 

TAATGTTTCT ATGATGGCAG ATTCAACTTC TCGTTGGGCT GAAGCTCTTA 300 

GAGAAATTTC TGGTAGATTA GCTGAAATGC CTGCAGATTC GGGTTACC CA 3 50 

GCATATTTAG GCGCCAGACT TGCTTCATTC TATGAAAGAT CAGGAAGAGT 400 

TAAATGTATG GGTTCCCCAG ATAGAGAAGG TACAGTAACA ATTGTTGGTG 450 

3 0 CAGTTTCTCC ACCT 464 



10 



15 



20 



2) INFORMATION FOR SEQ ID NO: 836 

35 3) 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmania donovani subsp. infantum 

(B) STRAIN: MOU 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836 

50 GCCCTCTCCA AGTACTCCAA CTCCGATTGC GTCATCTATG TCGGCTGCGG 50 

CGAGCGCGGT AATGAGATGG CCGAGGTGCT CATGGAGTTC CCGACCCTGA 100 

CGACCGTGAT CGATGGCCGC GAGGAGTCGA TCATGAAGCG CACCTGCCTC 15 0 

GTGGCGAACA CCTCGAACAT GCCAGTCGCA GCCCGTGAGG CCTCTATTTA 2 00 

CACCGGCATC ACCCTGGCCG AGTACTACCG TGATATGGGC AAGCATATCG 2 50 

55 CCATGATGGC TGACTCGACG TCTCGCTGGG CCGAGGCGCT TCGTGAGATT 3 00 

TCGGGTCGTC TGGCGGAGAT GCCGGCGGAT GGTGGCTACC CCGCCTACCT 350 

CAGCGCTCGT CTCGCCTCCT TCTACGAGCG CGCCGGCCTC GTCACCTGCA 40 0 

TCGGCGGGCC GAAGCGC CAG GGCTCCGTCA CGATCGTCGG TGCCGT 44 6 
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2 ) INFORMATION FOR SEQ ID NO: 837 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 4 56 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 837 



TTAGTCAGGC CCTCTCCAAG TACTCCAACT CCGACTGCGT CATCTATGTC 50 

GGCTGCGGCG AGCGCGGTAA TGAGATGGCT GAGGTGCTCA TGGATTTCCC 100 

2 0 AACTCTGACG ACCGTGATCG ATGGTCGCGA GGAGTCCATC ATGAAGCGCA 150 

CCTGCCTCGT GGCAAACACT TCGAACATGC CAGTCGCAGC CCGCGAGGCC 2 00 

TCTATTTACA CCGGCATCAC CCTGGCCGAG TACTACCGTG ATATGGGCAA 2 50 

GCATATTGCC ATGATGGCCG ACTCGACATC TCGCTGGGCC GAGGCGCTTC 300 

GTGAGATTTC CGGTCGTCTG GCGGAGATGC CAGC CGATGG TGGCTACCCT 350 

25 GCCTACCTCA GCGCTCGTCT CGCCTCCTTC TACGAGCGCG CCGGCCTCGT 4 00 

CACCTGCATC GGCGGGCCGA AGCGCCAGGG CTCCGTCACG ATCGTCGGTG 4 50 

CTGTGT 4 56 



30 



40 



2) INFORMATION FOR SEQ ID NO: 83 8 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 450 bases 
35 (B) TYPE: Nucleic acid 

<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: MOU-2 

4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 8 

AGGCCCTCTC CAAGTACTCC AACTCCGACT GCGTCATCTA CGTCGGCTGC 50 

GGCGAGCGCG GTAATGAGAT GGC CGAGGTG CTCATGGAGT TCCCGACCCT 100 

GACGACTGTG ATTGATGGCC GTGAGGAGTC GATCATGAAG CGGACCTGCC 150 

50 TCGTGGCCAA CACCTCCAAC ATGCCAGTCG CAGCCCGTGA AGCCTCTATT 2 00 

TACACTGGTA TCACCCTGGC CGAATACTAC CGTGATATGG GCAAGCATAT 250 

CGCCATGATG GCTGACTCGA CGTCTCGCTG GGCGGAGGCG CTTCGTGAGA 3 00 

TTTCGGGTCG CCTGGCGGAG ATGCCGGCTG ATGGTGGGTA CCCCGCCTAC 350 

CTTAGTGCTC GTCTTGCCTC CTTCTACGAG CGTGCCGGCC TCGTCACCTG 400 

55 CATCGGTGGG CCGAAGCGCC AGGGCTCCGT CACGATCGTC GGTGCCGTGT 450 



2 ) INFORMATION FOR SEQ ID NO: 839 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: EATRO 795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 9 

15 GCACTCTCGA AGTACTCGAA CAGTGACGCT GTTATTTACG TGGGTTGTGG 5 0 

X " TGAGCGTGGC AATGAGATGG CGGAGGTGCT CATGGACTTT CCCACCCTCA 10 0 

CCACCATTAT TGATGGACGT GAGGAGTCTA TCATGAAGCG CACATGCCTG 15 0 

GTGGCAAATA CTTCCAATAT GCCTGTTGCT GCTCGTGAGG CATCTATTTA 200 

CACTGGTATC ACCTTAGCTG AGTATTATCG GGATATGGGA AAACACATCG 250 

2 0 C C ATGATGGC CGATTCAACC TCCCGCTGGG CTGAGGCTTT GCGTGAGATT 3 00 

TCCGGTCGTC TTGCTGAAAT GCCTGCAGAT GGAGGTTATC CCGCGTACCT 350 

CAGCGCCCGT TTGGCCTCCT TCTACGAGCG TGCTGGCCGT GTGACATGCA 40 0 

TCGGTGGGCC GAAGCGTGAA GGGTCTGTAA CAATTGT 43 7 



25 

2) INFORMATION FOR SEQ ID NO: 840 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, cruzi 

(B) STRAIN: MM3 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 840 



TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

4 5 CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GACGCCGCCG 15 0 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 2 00 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 2 50 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTCG 3 00 

CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 

50 AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG 4 00 

GCAGGGCGAC AACATGATTG ACAAGT CGGA AAATATGCCG TGGTACAAGG 450 

GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 500 

AG CGACAAGC CGCTGCGCCT GCCGCTGCAG GACGTGTACA AGATCGGCGG 5 50 

TAT CGGCAC C GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 600 

55 GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 650 

ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 700 

CGGCTTCAAC GTGAAGAACG TGTCCGTGAA GGACATC CGC CGTGGCAACG 7 50 

TGTGCGGCAA CTCGAAGAAC GACCCCCCAA AGGAGGCGGC CGACTTCACG 800 

GCGCAGGTGA TCATCCTGAA CCACCCCGGC CAGATCGGCA ACGGCTATGC 850 

60 GCCGGTGCTC GACTGCCACA CCTGCCACAT CGCGTGCAAG TTCGCCGAGA 900 
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TCGAGTCCAA GATCGACCGC 
AAGTCGATCA AGTCCGGTGA 
GCCCATGTGC GTGGAGGTGT 
CC 



CGCTCCGGCA AGGAGCTTGA 
CGCCGCCATG GTGCGCATGG 
TCAACGACTA CGCTCCTCTT 



GAAGAACCCC 950 
TGCCGCAGAA 1000 
GGCCGCTTTG 1050 
1052 



2) INFORMATION FOR SEQ ID NO: 841 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 
20 (B) STRAIN: CGL-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841 

TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

2 5 TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GATGCCGCCG 15 0 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 2 00 

GACGGC CAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTTG 30 0 

3 0 CC CAGGAGC G CT AC GATGAG ATTGTGAAGG AGGTGTCGGC GTAC CTGAAG 3 50 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG 4 00 

GCAGGGCGAC AACATGATTG ACAAGTCGGA AAATATGC CG TGGTACAAGG 45 0 

GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 50 0 

AGCGACAAGC CGCTGCGCCT GCCGCTGCAG GATGTGTACA AGATCGGCGG 550 

3 5 TATCGGCACC GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 60 0 

GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 65 0 

ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 70 0 

CGGCTTCAAC GTGAAGAACG TGTCCGTGAA GGACATCCGC CGTGGCAACG 75 0 

TGTGCGGCAA CTCGAAGAAC GATCCCCCAA AGGAGGCGGC CGACTTCACG 8 00 

4 0 GCGCAGGTGA TCATCCTGAA CCACCCCGGC CAGATCGGCA ACGGCTATGC 850 

GCCGGTGCTT GACTGCCACA CCTGCCACAT CGCGTGCAAG TTCGCCGAGA 90 0 

TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTTGA GAAGAACCCC 95 0 

AAGTCGATCA AGTCCGGTGA CGCCGCCATG GTGCGCATGG TGCCGCAGAA 1000 

GCCCATGTGC GTGGAGGTGT TCAACGACTA CGCTCCTCTT GGCCGCTTTG 105 0 

4 5 CCGTGCGTGA C 10 61 



2) INFORMATION FOR SEQ ID NO: 842 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
6 0 (A) ORGANISM: Trypanosoma, cruzi 
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(B) STRAIN: PCU-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842 

GCTGAAGGCT GAGCGCGAGC GCGGCATCAC GATCGACATC GCGCTCTGGA 
AGTTCGAGTC GCCCAAGTCT GTGTTCACGA TCATCGACGC CCCCGGCCAC 
CGCGACTTCA T C AAGAAC AT GATCACGGGC ACGTCTCAGG CGGACGCCGC 
CGTCCTTGTC AT TG C GT CAT CGCAGGGTGA GTTTGAGGCG GGCATCTCGA 
AGGACGGCCA GACACGCGAG CACGCGCTGC TCGCCTTCAC GCTCGGCGTG 
AAG C AGATGG TTGTGTGCTG CAACAAGATG GACGACAAGT CGGTGAACTT 
CGCCCAGGAG CGCTACGATG AGATTGTGAA GGAGGTGTCG GCGTACCTGA 
AGAAGGTTGG GTACAACGTG GAGAAGGTGC GCTTCATCCC CATCTCCGGC 
TGGCAGGGCG ACAACATGAT TGACAAGTCG GAAAATATGC CGTGGTACAA 
GGGCCCCACG CTGC TGGAGG CACTCGACAT GCTGGAGCCC CCGGTGCGCC 
CCAGCGACAA GCCGCTGCGC CTGCCGCTGC AGGACGTGTA CAAGATCGGC 
GGTATCGGCA CCGTGCCGGT CGGTCGCGTG GAGACGGGCA CGATGAAGCC 
CGGCGACGTG GTGACGTTTG CGCCCGCCAA CGTGACGACG GAGGTGAAGT 
CGATTGAGAT GC AC CACGAG CAGCTGGCCG AGGCCACGCC CGGCGACAAC 
GTCGGCTTCA ACGTGAAGAA CGTGTCCGTG AAGGACATCC GCCGTGGCAA 
CGTGTGCGGC AACTCGAAGA ACGACCCCCC AAAGGAGGCG GCCGACTTCA 
CGGCGCAGGT GATCATCCTG AACCACCCCG GCCAGATCGG CAACGGCTAT 
GCGCCGGTGC TCGACTGCCA CACCTGCCAC ATCGCGTGCA AGTTCGCCGA 
GATCGAGTCC AAGATCGACC GCCGCTCCGG CAAGGAGCTT GAGAAGAACC 
CCAAGTCGAT CAAGTCCGGT GACGCCGCCA TGGTGCGCAT GGTGCCGCAG 
AAGCCCATGT GCGTGGAGGT GTTCAACGAC TACGCTCCTC TTGGCCGCTT 
TGCCGTGCGT GA 



, 2 ) INFORMATION FOR SEQ ID NO 843 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843 

TACATTGACA GCTGCATTGA CCAAGGTCTG TTCGATGGGT GGTCATGGCG 
AGTACACTCC TTATGAAGCA ATTGACCGTG CTCCTGAGGA GCGTAAACGT 
GGTATTACTA TAAATTCGAC ACATGTTGAA TATGAGACTA AGAACCGTCA 
TTACGGTCAC GTGGACTGTC CAGGTCACTC TGATTATGTG AAGAACATGA 
TATCTGGCGC TGCTCAGATG GATGGTGCCA TATTGGTTGT TTCTTGTGTT 
GACGGTCCCA TGCCTCAGAC TAAGGAGCAC GTGTTGCTTG CTAAGCAGAT 
TGGTGTACCT CGTTTAGTTG TGTTTTTGAA CAAGCTTGAC ATGTTAGAGG 
ACTCTGAGCT ATTGGAGTTG GTGGAGTTAG AGGTT CGTGA GTTATTGAGT 
GAGTTTGGTT ACGACGGTGA CAACACGCCT ATCGTTCGTG GCAGTGCTAT 
AAAGGCATTG AACAGTAGTT CCGAGGCTGA C ATTAAGC C A ATTCAGGATT 
TATTGGATGC GTGTGATGCC TTTTTACTGA CTCCAGAACG TAAGGATGAC 
ATGCCGCTCT TGGTTGCTAT TGACGATGTT CTTGC CATTC CTGGCAAGGG 
TACTGTTGTA ACCGGTAGGA TAGAGCAGGG CAAGATTCGT TGTGGTGACC 
CTATTGAGGT TTGCGCCGGT CCGAAGTCCG GCAAGAAGAC TGTGTGTGTT 
GGTCTTRARA TGTTCCGCAA GAGTCTCAGT RAGGGTATTG CTGGTGACCA 
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GATTGGTGTT TTGCTCAAGG GTGTGAAGCG 
TTGTATTGAT TCAACCCGGA AGTTACAAAT 
GACTTGTACG TGTTGACTAC GRAGGAAGGT 
GTCTAACTAC CGTCCTCAGG CGTTTATACG 
5 CAGTTCATTT GGATRAGGGT GTTGAGATGG 
CGTTGCAAGA TCAAGTTACT TTATCCCATG 
ATTTGCG 



CGACRAGGTA GAGCGCGGGT 800 

GTCACGGTGA ATTTGATGCT 850 

GGGCGCAAGC AT CCGTTTGT 90 0 

TACTGGAGAC GTTTGCTGCT 95 0 

CAGCTCCTGG TGACAACGTG 100 0 

CCTGTCCATG AAGGTTTACG 1050 

1057 



10 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 844 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844 



CGGTGGCATC 
GCGAGCACCT 
TTCATCAACA 
3 0 CATGGAGGTG 
CACCAATCGT 
TACGAGGAGA 
CGACCCGCCG 
TTTACGAGAT 

3 5 GATCAGGGCG 

CGCCAAGAAG 
CGCTGAGCGA 
ACCGGCGACA 
AATGGCGGCG 

4 0 TGTACGTGCT 

CACTACCGCC 
GAGCTTCCCG 
GCCGCGGCCC 
GAGAG CAAGC 
45 GCTCGCGTAC 



ATTGTGGTGG 
CCTGATCTGC 
AAGTGGATAT 
CGCGAGCAGC 
GCGCGGCTCG 
ACATCCTCGA 
CGCAACACAG 
CGG CAAGGAC 
TGCTGAAGCT 
TCGACGGTGA 
GTGCATGCCC 
CAACCAGTCT 
ACGGGTAGCA 
GACGAAGGAT 
CGCAGCTCTT 
GAGGCGGAGA 
CGAGGAGGAC 
TCGTCTGCAT 
CCGATGCCCA 



CGGCCACCGA 
TCGCAGATTG 
GACGGACGAG 
TGGAGAAATA 
GCCCTCAAGG 
ACTGGTGCGG 
ACAAGCCTTT 
AAGAAGAGCG 
CAACACAGAC 
GGGTGACGGG 
GGTGACTCCG 
GTCCAAGGAC 
CGAACCTGTA 
GAGGGCGGCC 
CTTCCATTGC 
AGCACCGGGA 
AAGAAGAAAG 
GCCGGGCGAT 
TTGAAAAGGG 



CGGCGTCATG 
GGCTTCCGGC 
GACACGTGCG 
CAAGTTTCCG 
CCGTCGAGGG 
AAGTGCGACG 
CCTTATGGCC 
TCATCGTGAC 
GCCGAGCTGG 
CATCGAGATG 
TCGGCGTCAG 
AACGTGGAAC 
CAACAAGGTG 
GCCACACCGG 
GCTGACGTGA 
GGAGCTGAAC 
AGGCGGAGAT 
AACCGCGAGC 
CCTGAAGTTC 



CCGCAGACAC 50 

GCTCGTAGGG 100 

ACCTGGTGGA 15 0 

GCGGAGGAGA 2 00 

CGACGCGAAG 250 

AGTGGATCCC 3 00 

ATCGAGCACG 350 

CGGCCGCGTC 4 00 

CCGGCTTCAG 4 50 

TACCACAAGA 500 

CATTGTCGGC 550 

GCGGCATGGT 60 0 

AAGGCGCAGG 650 

CTTCAGCCCC 7 00 

CAGCGGACAT 750 

AAGAAATTCG 800 

GAAGGAGTTC 850 

TGATCCTGAC 900 

ACC 943 



2) INFORMATION FOR SEQ ID NO: 845 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Leishmania amazonensis 
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(B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 5 

c pggtGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCATCT CTTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 2 00 

CGCCCATCGT GCGCGGCTCG GCCCTCAAAG CCGTCGAGGG CGACGCGAAG 2 50 

10 TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 300 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 3 50 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 4 00 

GATCAGGG CG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

15 CGCTGAGTGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 55 0 

ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 600 

AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 750 

2 0 GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCT CAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGCGGAGAT GAAAGAGTTC 850 

GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 90 0 

GCTGGCGTAC CCGATGCCCA TTGACAAGGG TCTGAAGTT 93 9 



25 



2) INFORMATION FOR SEQ ID NO: 846 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 94 5 bases 

.(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani subsp . donovani 

(B) STRAIN: ATCC 50212 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846 

CATTGTGGTG GCGGCCACCG ACGGCGTCAT GCCGCAGACA CGCGAGCACC 50 

TCCTGATCTG CTCGCAGATC GGGCTTCCGG CGCTCGTAGG GTTCATCAAC 100 

45 AAGGTGGACA TGACGGACGA GGACACGTGC GACCTGGTGG ACATGGAGCT 150 

GCGCGAGCAG CTGGAGAAAT ACAAGTTTCC GGCGGAGGAG ACGCCAATCG 200 

TGCGCGGCTC AGCCCTCAAA GCCGTCGAGG GCGATGCGAA GTACGAGGAG 250 

AACATTCTCG AACTGGTGCG GAAGTGTGAC GAGTGGATCC CTGACCCGCC 3 00 

GCGCAACACA GACAAGCCTT TCCTTATGGC CATCGAGCAC GTTTACGAGA 350 

50 TCGGCAAGGA CAAGAAGAGC GTTGTCGTGA CCGGCCGCGT CGATCAGGGC 4 00 

ATTCTGAAGC TCAACACAGA CGCCGAGCTG GCCGGCTTCA GCTCCAAGAA 4 50 

GTCGACGGTG AGGGTGACGG GCAT CGAGAT GTACCACAAG ACGCTGAGCG 5 00 

AGTGCATGCC TGGTGACTCC GTCGGCGTCA GCATTGTCGG GACCGGCGAC 550 

ACGACCAGTC TATC CAAGGG CAACGTGGAA CGCGGCATGG TGATGGCGGC 6 00 

55 GACGGGTAGC ACGAAC CTGT ACAACAAGGT GAAGGCGCAG GTGTACGTGC 650 

TGACGAAGGA TGAGGGCGGC CGCCACACTG GCTTTAGTCC TCACTACCGC 7 00 

CCGCAGCTCT TCTTCCATTG TGCTGACGTG ACGGCGGACA TGAGCTTCCC 750 

GGAGGC GGAG AAGCACCGCG AAGAGCT CAA CAAGAAATTC GGCCGCGGCC 8 00 

CCGAGGAGGA CAAGAAGAAA GAGGCAGCGA TGAAGGAGTT CGAGAGCAAG 8 50 

60 CTCGTCTGCA TGCCGGGCGA TAACCGCGAG CTGATCCTGA CGCTGGCGTA 90 0 
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GACGAGGATA CGTGCGACCT CGTGGACATG GAGGTGCGGG AACAGCTGGA 150 

GAAGTACAAG TTTCCGGCCG AGGAGACGCC CATCGTACGT GGCTCGGCCC 200 

TCAAGGCCCT CGAGGGGGAT GCGCAATACG AGGGGAGTAT TCTCGAGCTG 250 

GTGCGAAAGT GCGACGAGTG GATCCCCGAC CCGCCGCGCA ACACCGAAAA 3 00 

5 GCCTTTCCTC ATGGCTATCG AGCACGTTTA CGAGCTCGGC AAAGACAAGA 350 

AGAGCGTCAT CGTTACCGGC CGCGTCGATC AAGGTGTGCT GAAGCTCAAC 4 00 

ACAGACGCCG AGCTGGCCGG CTTCAGCGCC AAGAAGGCGA CAGTCAAAGT 450 

GACGGGCATC GAGATGTAT C ACAAGACACT CAATGAGTGC ATGCCCGGCG 5 00 

ACTCTGTCGG TGTCAGCATC GTCGGTACCG GTGACACGAC CAGCTTATCC 550 

10 AAGGATAATG TTGAGCGCGG TATGGTAATG GCGGCAACGG GTAGCACGAA 600 

CCTGTACAAC AAGCTGAAGG CGCAGGTTTA CGTGCTGACA AAGGAGGAGG 650 

GTGGCCGCCA CACCGGGTTC AGCCCCCACT ACCGCCCGCA GCTCTTCTTC 70 0 

CACTGCGCTG ACGTGACCGC AGACATGAGC TTCCCGGAGG CGGAGAAGTA 750 

CCGCGAGGAG CTCAACAAGA AGTTCGGCCG TGGCCCTGAG GAGGACAAGA 8 00 

15 AGAAAGAGGC GGAGATGAAG GAGTTCGAAA GCAAACTTGT CTGCATGCCA 8 50 

GGCGATAACC GCGAGCTGAT CCTAACTCTG GCGTACCCGA TGCCCATCGA 900 

CAAGGGCCTG AAGTTCACCA TCCGTGAGGG CGG 93 3 
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2) INFORMATION FOR SEQ ID NO: 849 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 943 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerhilli 

(B) STRAIN: ATCC 50121 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 84 9 

CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 5 0 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

40 CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 2 00 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 3 50 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

4 5 GAT CAGGGCG TGCTGAAGCT CAACACGGAC GCCGAGCTGG CCGGCTTCAG 4 50 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATTGAGATG TACCACAAGA 50 0 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

AC CGG CGACA CGACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 6 50 

50 TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGC GCTGACGTGA CAGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 8 00 

GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG AGGCGGAGAT GAAGGAGTTC 8 50 

GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT AACCGCGAGC TGATCCTGAC 900 

55 GCTGGCGTAC CCGATGCCCA TTGAAAAGGG TCTGAAGTTC ACC 94 3 



2) INFORMATION FOR SEQ ID NO: 850 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850 

15 GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

ACACCTCCTG ATCTGCTCGC AAATTGGCCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAAGT GGACATGACG GACGAGGACA CGTGTGACCT GGTGGACATG 150 

GAGGTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACACC 2 00 

AATCGTGCGC GGCTCGGCCC TCAAGGCCGT CGAGGGCGAC GCGAAGTACG 2 50 

20 AGGAGAACAT CCTCGAACTG GTGCGGAAGT GCGACGAGTG GATCCCCGAC 3 00 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 350 

CGAGATCGGC AAGGACAAGA AGAGCGTCAT CGTGACCGGC CGCGTCGATC 400 

AGGGCGTGCT GAAGCTCAAC AC AGACG C CG AGCTGGCCGG CTTCAGCGCC 4 50 

AAGAAGT CGA CGGTGAGGGT GACGGGCATT GAAATGTACC ACAAGACGCT 500 

2 5 GAGCGAGTGC ATGCCCGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 55 0 

GCGACACGAC CAGTCTGTCC AAGGACAACG TGGAGCGCGG CATGGTAATG 60 0 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTC AGCCCCCACT 700 

ACCGCCCGCA GCTCTTCTTC CATTGCGCTG ACGTGACAGC GGACATGAGC 750 

30 TTCCCGGAGG CGGAGAAGCA CCGCGAGGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC GGAGATGAAG GAGTTCGAGA 85 0 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 90 0 

GCGTACCCGA TGCCCATT 918 
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2) INFORMATION FOR SEQ ID NO: 851 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 93 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lelshmania mexicana 

(B) STRAIN: ATCC 50156 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCATCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

55 TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 2 00 

CGCCCATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 2 50 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 3 50 

60 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 
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GATCAGGGCG TGCTGAAGCT 
CGT CAAGAAG TCGACGGTGA 
CGCTGAGTGA GTGCATGCCC 
ACCGGCGACA CGATCAGTCT 
5 AATGGCGGCA ACGGGTAGCA 
TGTACGTGCT GACGAAGGAT 
CACTACCGCC CGCAGCTCTT 
GAGCTTCCCG GAGGCGGAGA 
GCCGCGGCCC CGAGGAGGAC 
10 GAGAGCAAGC TCGTCTGCAT 
GCTGGCGTAC CCGATGCCCA 



CAACACAGAC GCCGAGCTGG 
GGGTGACGGG CATCGAGATG 
GGTGACTCCG TCGGCGTCAG 
CTCCAAGGAC AACGTTGAAC 
CGAACCTGTA CAACAAGGTG 
GAGGGCGGCC GCCACACTGG 
CTTCCATTGT GCTGACGTGA 
AGCACCGCGA GGAGCTCAAC 
AAGCAGAAGG AGGCGGAGAT 
GCCGGGCGAC AACCGCGAGC 
TTGAGAAGGG TCTGAAGTT 



CCGGCTTCAG 4 50 

TACCACAAGA 500 

CATTGTCGGC 550 

GCGGCATGGT 6 00 

AAGGCGCAGG 650 

CTTCAGCCCC 700 

CGGCGGACAT 750 

AAGAAATTCG 8 00 

GAAAGAGTTC 850 

TGATCCTGAC 900 
939 



15 2) INFORMATION FOR SEQ ID NO: 852 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: MOU-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 852 



30 



TCATTGTGGT GGCCGCCACC GACGGCGTCA TGCCGCAAAC ACGGGAGCAC 50 

CTTTTGATCT GCTCGCAGAT CGGGCTGCCG GCGCTCGTAG GGTTCATCAA 10 0 

CAAAGTGGAC ATGACAGACG AAGACACGTG CGACCTGGTA GACCTGGAGG 150 

TGCGTGAGCA GC TGGAGAAG TACAAGTTTC CGGCAGAGGA AACACCAATC 2 00 

35 GTGCGTGGCT CGGCCCTCAA GGCCGTTGAG GGCGATGCAA AGTACGAGGA 2 50 

GAACATCCTC GAACTGGTGC GGAAGTGCGA CGAGTGGATC CCAGACCCGC 3 00 

CACGCAATAC GGACAAGCCT TTCCTTATGG CCATTGAACA CGTGTACGAG 350 

ATCGGCAAGG ATAGGAAAAG CGTCATCGTA ACCGGCCGCG TCGATCAAGG 4 00 

TGTGCTGAAG CTGAACACAG ACGCCGAGCT GGCCGGCTTC AGCGCCAAGA 4 50 

4 0 AGTCGACGGT GAAAGTGACG GGCATTGAGA TGTACCACAA GACACTGACA 500 

GAGTGCATGC CCGGCGACTC TGTCGGCGTC AGCATTGTGG GCACTGGYGA 550 

CACGACCAGC CTCTCTAAGG ACAATGTTGA GCGTGGCATG GTACTGGCCG 600 

CTACGGGTAG CACGAACCTG TACAACAAAG TAAAGGCGCA GGTGTATGTA 65 0 

CTCACGAAGG ATGAGGGCGG CCGCCACACC GGCTTCAGCC CCCACTACCG 700 

45 TCCGCAGCTC TTCTTCCACT GCGCTGACGT AACGGCGGAC ATGAGCTTCC 750 

CGGAGGCGGA GAAGCACCGC GAGGAACTCA ATAAGAAATT CGGCCGCGGC 8 00 

CCCGAGGAGG ACAAGAAAAA GGAGGCGGAG ATGAAGGAGT TCGAGAGCAA 8 50 

GCTGGTCTGC ATGC CAGGCG ATAACCGCGA GCTGATCCTG ACATTGGCGT 900 

ACCCGATGCC TA 912 

50 



2) INFORMATION FOR SEQ ID NO: 853 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

5 (B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853 

ATTCTTGTGG TGGCAGCTAA CGACGGATGC ATGCCGCAGA CGCGTGAGCA 50 

10 CCTGCTTATT TGTTCGCAGA TTGGCCTTCC TGCTCTTGTA TGCTTTATCA 100 

ATAAGTGTGA CATGATGCAA GGGCAGGAGG AAATGATTGA ACTTGTTGAA 150 

ATGGAGGTAC GTGAACTTTT GGAGAAGTAC AAGTTCCCTG CGGAGGAGAC 2 00 

GCCATTTGTG CGGGGGTCTG CGGTGAAGGC ATTGGAGGGT GATGCTGAAA 250 

ATGAAGGAAA GATTTTGGAG CTTGTAAAAA AATGTGATGA ATGGATTCCC 3 00 

15 GACCCACCGC GTGCCATTGA AAAACCGTTC CTTATGGCCA TTGAGCACGT 3 50 

TTTTGAGGTT GGAAAGGATA AGAAGGCCGT TGTTGTGAGC GGGCGTGTGG 400 

ACCAGGGGCA GTTGAAGGTC GGCGCAGATG CAGAACTTTC CGGGTTTAGC 4 50 

GCAAAGAAGC TGACGGTGAA GGTTGCTAGC ATCGAAATGT ACCATAAAAT 500 

TCTGGAGGAT TGCATGCCTG GTGACTCTGT TGGCGCGAAG ATCGTTGGCA 550 

2 0 GCGGTGAAAC AGTGAACCTG TCGAAGGAAA ATGTGGAACG CGGCATGGTA 6 00 

CTCTCCGCAC CAGGTGCAAC GACACTGTTC AACAAGGTCC GCGCGCAGGT 650 

GTACGTGTTG ACAAAGGAAG AAGGCGGTCG TCACACAGCC TTTAGTCCTC 700 

ACTATCGTCC GC AGCTTTTC TTCCACTGTG CTGATGTCAC GGCAGATATT 750 

AACTTCC CGG AAAGCGAGAA GCTTGCAGGG GAGCTGAACA AAAAGTATGG 800 

2 5 CCGTGATGCG GCGGAACAGA AGAAGAAGGA GGCAGAACTG AAAGAGTTTG 850 

AAAAGACGCT TGTCTGCATG CCTGGTGATA ACCGCGAACT CCTGCTCACC 90 0 

CTTGCCTATC CAATGC CAAT GGAAAAGGGA CTCAAG 93 6 
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2) INFORMATION FOR SEQ ID NO: 854 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 934 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 

4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854 

CGGCATTCTT GTGGTGGCAG CTAACGACGG ATGCATGCCG CAGACGCGTG 50 

AGCAC CTGCT TATTTGTTCG CAGATTGGCC TTCCTGCTCT TGTATGCTTT 100 

ATCAATAAGT GTGACATGAT GCAAGGGCAG GAGGAAATGA TTGAGCTTGT 150 

50 TGAAATGGAG GTACGTGAAC TTTTGGAGAA GTACAAGTTC CCTGCGGAGG 2 00 

AGACGCCATT TGTGCGGGGG TCTGCGGTGA AGGCATTGGA GGGTGATGCT 250 

GAAAATGAAG GAAAGATTTT GGAGCTTGTA AAAAAATGTG ATGAATGGAT 3 00 

TCCCGACCCA CCGCGTGCCA TTGAAAAACC GTTCCTTATG GCCATTGAGC 3 50 

AC GTTTTTGA GGTTGGAAAG GATAAGAAGG CCGTTGTTGT GAGCGGGCGT 4 00 

55 GTGGACCAGG GGCAGTTGAA GGTCGGCGCA GATGCAGAAC TTTCCGGGTT 4 50 

TAGTGCAAAG AAGCTGACGG TGAAGGTTGC TAGCATCGAA ATGTAC C ATA 5 00 

AAATTCTGGA GGATTGCATG CCTGGTGACT CTATTGGCGC GAAGATCGTT 550 

GGCAGCGGTG AAACAGTGAA CCTGTCGAAG GAAAATGTGG AACGCGGCAT 6 00 

GGTACTCTCC GCACCAGGTG CAACGACACT GTTCAACAGG GTCCGCGCGC 65 0 

60 AGGTGTACGT GTTGACAAAG GAAGAAGGCG GTCGTCACAC AGCCTTTAGT 70 0 
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CCTCACTATC GTCCGCAGCT 
TATTAACTTC CCGGAAAGCG 
ATGGCCGTGA CGCGGCGGAA 
TTTGAAAAGA CGCTTGTCTG 
5 CACCCTTGCC TAT C CAATG C 



TTTCTTCCAC TGTGCTGATG 
AGAAGCTTGC AGGGGAGCTG 
CAGAAGAAGA AGGAGGCAGA 
CATGCCTGGT GATAACCGCG 
CAATGGAAAA GGGA 



TCACGGCGGA 750 
AACAAAAAGT 8 00 

ACTGAAAGAG 8 50 

AACTCCTGCT 900 
934 
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2 > INFORMATION FOR SEQ ID NO: 855 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 937 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 855 

2 5 TGGCGGCATT CTTGTGGTGG CAGCTAACGA CGGATGCATG CCGCAGACGC 50 

GTGAGCACCT GCTTATTTGT TCGCAGATTG GCCTTCCTGC TCTTGTATGC 100 

TTTATCAATA AGTGTGACAT GATGCAAGGG CAGGAGGAAA TGATTGAACT 150 

TGTTGAAATG GAGGTACGTG AACTTTTGGA GAAGTACAAG TTCCCTGCGG 2 00 

AGGAGAC GCC ATTTGTGCGG GGGTCTGCGG TGAAGGCATT GGAGGGTGAT 250 

30 GCTGAAAATG AAGGAAAGAT TTTGGAGCTT GTAAAAAAAT GTGATGAATG 300 

GATTC CCGAC CCACCGCGTG CCATTGAAAA ACCGTTCCTT ATGGCCATTG 3 50 

AGCACGTTTT TGAGGTTGGA AAGGATAAGA AGGC CGTTGT TGTGAGCGGG 4 00 

CGTGTGGACC AGGGGCAGTT GAAGGTCGGC GCAGATGCAG AACTTTCCGG 450 

GTTTAGCGCA AAGAAGCTGA CGGTGAAGGT TGCTAGCATC GAAATGTACC 500 

35 ATAAAATTCT GGAGGATTGC ATGCCTGGTG ACTCTGTTGG CGCGAAGATC 550 

GTTGGCAGCG GTGAAACAGT GAACCTGTCG AAGGAAAATG TGGAACGCGG 600 

CATGGTACTC TCCGCACCAG GTGCAACGAC ACTGTTCAAC AAGGTCCGCG 650 

CGCAGGTGTA CGTGTTGACA AAGGAAGAAG GCGGTCGTCA CACAGCCTTT 7 00 

AGTCCTCACT ATCGTCCGCA GCTTTTCTTC CACTGTGCTG ATGTCACGGC 750 

4 0 AGATATTAAC TTCCCGGAAA GCGAGAAGCT TGCAGGGGAG CTGAACAAAA 800 

AGTATGGCCG TGATGCGGCG GAACAGAAGA AGAAGGAGGC AGAACTGAAA 850 

GAGTTTGAAA AGACGCTTGT CTGCATGCCT GGTGATAACC GCGAACTCCT 90 0 

GCTCACCCTT GCCTATCCAA TGCCAATGGA AAAGGGA 937 
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2) INFORMATION FOR SEQ ID NO: 856 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia Jbi gremina 

(B) STRAIN: Suarez-2 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856 

CTTATTATGG AGCTGATCAA CAACGTCGCG AAGAAGCACG GTGGTTTCTC 5 0 

CGTGTTCGCC GGCGTCGGCG AGCGCACCAG GGAGGGCAAC GAGCTGTACC 100 

5 ACGAGATGAT GGAGACCGGC GTCATCAAGC GCCGCCAGCT GGATGACGGC 150 

ACGTTCGACT TCTCCGGCTC CAAGGCCGCG CTGGTGTACG GCCAGATGAA 2 00 

CGAGCCGCCA GGTGC CAGGG CGCGTGTTGC CCTCACTGGC CTGACGGTGG 250 

* CCGAGTACTT CCGTGATGAG GACGGCCAGG ACGTGCTGCT CTTCATCGAC 3 00 

AACATCTACC GTTTCACCCA GGCTGGTTCT GAGGTGAGTG CCCTTTTGGG 350 

10 GCGCATCCCG TCCGCCGTCG GTTACCAGCC GACCCTCGCC ACCGACCTTG 4 00 

GCGCGCTGCA GGAGCGTATC ACGACGACCA ACAAGGGCTC CATCACCTCC 4 50 

GTGCAGGCCG TCTACGTGCC GGCCGACGAT ATCACCGACC CGGCGCCTGC 5 00 

GACCACCTTC ACCCATCTGG ACGCGACCAC TGTGCTCTCC CGTTCCATCG 550 

CCGAGCTGGG TATCTACCCC GCCGTCGACC CGCTCGACTC CACCTCGCGT 600 

15 ATGCTGTCCG CGAACATCGT CGGCGAGGAG CAGTACAACG TGGCGCGTGG 650 

CGTGCAGAAA ATACTGCAGG ACTACAAATC GCTGCAGGAT ATCATCGCCA 70 0 

TCCTGGGTAT GGACGAGCTG TCTGAGCAGG ACAAGTTCGT CGTCGCGCGT 75 0 

GCGCGCAAGG TTCAGCGTTT CCTATCCCAG CCCTTCCAGG TGGCTGAGGT 8 00 

ATTCACCGGC AAGCCCGGAC GTTTCGTCGA GCTGCAGGAC ACCAT CAGCG 8 50 

2 0 GCGTCAAGGA GATTTTGGAC GGCGAGTGCG ACGACATG 88 8 



2) INFORMATION FOR SEQ ID NO: 857 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 5 (A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857 

4 0 TGATTATGGA ATTGATCAAC AATGTCGCCA AGAAACACGG TGGGTTCTCC 50 

GTGTTCGCTG GTGTTGGTGA ACGTACGAGG GAAGGTAACG AACTGTACCA 100 

TGAAATGATG GAAACGGGTG TCATCAAGCG CCGTCAACTG GAAGACGGAA 150 

CATTTGACTT CTCGGGCTCT AAAGCTGCTT TGGTGTACGG ACAAATGAAC 2 00 

GAACCAC C AG GTGCTAGAGC CCGTGTTGCA CTCACGGGAT TGACCGTTGC 2 50 

4 5 CGAGTATTTC CGTGATGAAG AGGGGCAGGA TGTGCTACTC TTCATCGATA 3 00 

ACATCTACCG TTTCACCCAG GCCGGTTCCG AAGTGAGTGC GCTGTTAGGA 3 50 

AGAATTCCAT CCGCCGTGGG TTATCAACCT ACATTGGCCA CTGATCTCGG 4 00 

AGCACTCCAG GAACGCATTA CTACAAC CAA CAAGGGTTCG ATTACATCAG 4 50 

TCCAGGCAGT ATACGTC CCA GCCGATGATA TCACTGATCC CGCTCCAGCT 500 

50 ACCACTTTCT CGCACTTGGA TGCCACTACA GTGCTTTCTC GTTCAATTGC 550 

GGAGTTGGGT ATTTACCCTG CGGTCGACCC GCTTGACTCA ACGTCACGTA 600 

TGCTGTCGGC CAACATTGTA GGACAGGAAC AGTACGATGC CGCACGTGGT 650 

GTACAGAAAA TTTTACAGGA CTACAAATCA CTGCAGGATA T CATTGC CAT 700 

TCTGGGTATG GACGAGCTGT CTGAGCAGGA CAAGTTCGTT GTAGCACGCG 750 

55 CCCGTAAGGT ACAGCGTTTC CTGTCTCAGC CGTTCCAAGT GGCTGAGGTG 8 00 

TTCACCGGCA AGCCTGGGAG GTTCGTTGAA CTACAGGATA CCATCAGCGG 850 

TGTCAAGGAA ATC TGGAAGG TGAGTGTGAC GATA 8 84 
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2) INFORMATION FOR SEQ ID NO: 858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 71 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microtti 

(B) STRAIN: Persing-1 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 58 

TGGAACTGAT TAATAATGTG GCCAAAAAGC ATGGCGGTTA CTCTGTTTTT 50 

GCAGGTGTAG GTGAAAGGAC GAGGGAGGGT AATGAATTGT ACCATGAAAT 100 

GATGGAGACA GGTGTTATAA AGAAAAAGGC ACTAGGTGGT GGGAAGTTTG 150 

2 0 ATTTCAGTGG ATCTAAAGCA GCGCTGGTCT ATGGACAAAT GAACGAGCCA 2 00 

CCTGGGGCCC GTGCTAGAGT GGCACTAACT GGATTAACAG TCGCAGAATA 2 50 

TTTCCGTGAC GAACAAGGAC AAGACGTGTT GTTGTTTATT GATAATATTT 3 00 

ACCGATT TAC TCAGGCAGGG TCTGAGGTTT CAGCCTTGCT AGGCCGTATA 3 50 

CCTTCAGCTG TGGGATACCA GCCTACATTG GCAACAGATC TTGGCTGTTT 4 00 

25 ACAAGAACGA ATTACTACGA CCAAATCTGG TTCAATCACC AGTGTACAAG 450 

CTGTGTATGT GCCAGCAGAT GATATTACTG ATCCAGCGCC TGCCACAACT 50 0 

TTTACTCACT TGGACGCTAC TACTGTACTT AGCAGGCCAA TTGCTGAACT 550 

CGGTATT TAT CCAGCGGTAG ACCCGTTGGA TTCAACAAGC CGTATGCTAA 600 

GCGCGAACAT TGTGGGAAAT GAACAC TATA GTGTAGCCCG TTCCGTGCAG 65 0 

3 0 AAGAT AC TGC AAGATTACAA ATCGCTTCAG GACATTATTG CCATTTTGGG 7 00 

TATGGATGAA CTGTCGGAAC AAGACAAAAA T ATAGTAGC C CGAGCAAGGA 75 0 

AGATGCAAAG GTTCTTATCA CAGCCATTCC AAGTGGCGGA AGTTTTTACT 800 

GGTAAACCGG GAAGAT TTGT GGAATTGGAA GATACAATTG CCGGGGCACG 850 

AGATATAATT GCGGGTAATT G 871 

35 



2 ) INFORMATION FOR SEQ ID NO: 859 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 86 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 
50 (B) STRAIN: ATCC 50126 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 859 

TCTCGGAGGG CGTCCCGCCC GTGCTGACGG CGCTGGATGT 
55 CTTGGCCGCG ATGAGCCGCT GACGCTGGAG ATCGTGCAGC 
GAACAC CGGC CGCTGCATTG CGATGCAGAC GACGGACCTG 
AGT CGAAGGT TGTGTCGACC GGCGGCAACA TCTCTGTGCC 
GAGACGCTGG GCCGCATCTT CAACGTGCTG GGCGACGCGA 
CGGCCCCGTG GGTGAGAAGA TGCGCATGGC GATCCACGCC 
60 AGCTGGCGGA TCAGGCCGCG GAGGACACGA TCCTGACGAC 



GACGGAGGAC 50 

ACCTGGACGC 10 0 

CTGAAGCTGA 150 

GGTGGGCCGT 2 00 

TCGACCAGCG 250 

GAGGCCCCGA 3 00 

CGGCATCAAG 3 50 
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GTGATCGACC TGATTCTGCC CTACTGCAAG GGTGGCAAGA TCGGCCTGTT 400 

TGGCGGCGCC GGTGTGGGCA AGACCGTGAT CATCATGGAG CTGATTAACA 450 

ACGTCGCGAA GGGCCACGGT GGTTTCTCGG TGTTTGCCGG CGTTGGCGAG 50 0 

CGCACGCGCG AGGGCACGGA CCTGTACCTG GAGATGATGC AGTCGAAGGT 55 0 

5 GATTGAC CTG AAGGGCGAGT CGAAGTGCGT GCTTGTGTAC GGG CAGATGA 60 0 

ACGAGCCCCC GGGTGCGCGC GCGCGCGTTG CGCAGTCTGC GCTGACGATG 650 

GCGGAGTACT TCCGAGACGT GGAGGGCCAG AATGTGCTGC TGTTCATCGA 700 

CAACATCTTC CGCTTCACGC AGGCGAACTC CGAGGTCTCT GCGCTGCTGG 75 0 

GCCGCATTCC GGCCGCCGTG GGCTACCAGC CGACGCTTGC GGAGGATCTT 8 00 

10 GGTATGCTGC AGGAGCGCAT CACGTCGACG ACGAAGGGGT CGATCACGTC 8 50 

CGTGCAGGCC GTGTACGTGC CTGCGGATGA TAT CACGGAT CCGGCGCCCG 90 0 

CGACGACGTT CTCGCACCTG GACGCGACGA CTGTGCTGGA CCGCGCGGTG 95 0 

GCGGAGTCGG GGATCTACCC TGCCGTGAAC CCGCTGGAGT GCGCGTCGCG 1000 

TATCATGGAC CCCGATGTGA TCGACGTGGA CCACTACAAC GTTGCGCAGG 1050 

15 ATATCGTGCA GATGCTGACC AAGTACAAGG AGCTGCAGGA TATCATTGCG 1100 

GTGCTTGGTA TCGACGAGCT GAGCGAGGAG GACAAGGTCG TGGTGGACCG 1150 

CGCGCGCAAG GTGACCCGGT TCCTGTCGCA GCCGTTCCAG GTTGCGGAGG 12 0 0 

TGTTCACGGG CATGACGGGC CACTACGTGC AGCTGGCCGA CACGGTGGAG 125 0 

TCGTTCTCTG GGCTGCTGAT GGGGTCGTAC GACCAG 12 8 6 

20 



2) INFORMATION FOR SEQ ID NO: 860 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1222 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
35 (B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860 

CTCGGAGGGC GTCCCGCCCG TGCTGACGGC GCTGGATGTG ACGGAGGACC 5 0 

4 0 TTGGCCGCGA TGAGCCGCTG ACGCTGGAGA TCGTGCAGCA CCTGGACGCG 100 

AACACCGGCC GCTGCATTGC GATGCAGACG ACGGACCTGC TGAAGCTGAA 150 

GTCGAAGGTT GTGTCGACCG GCGGCAACAT CTCTGTGCCG GTGGGCCGTG 2 00 

AGACGCTGGG CCGCATCTTC AACGTGCTGG GCGACGCGAT CGACCAGCGC 2 50 

GGCCCCGTGG GTGAGAAGAT GCGCATGGCG ATCCACGCCG AGGCCCCGAA 3 00 

4 5 GCTGGCGGAT CAGGCCGCGG AGGACACGAT CCTGACGACC GGCATCAAGG 350 

TGATCGACCT GATTCTGCCC TACTGCAAGG GTGGCAAGAT CGGCCTGTTT 4 00 

GGCGGCGCCG GTGTGGGCAA GACCGTGATC ATCATGGAGC TGATTAACAA 4 50 

CGTCGCGAAG GGC CACGGTG GTTTCTCGGT GTTTGCCGGC GTTGGCGAGC 500 

GCACGCGCGA GGGCACGGAC CTGTACCTGG AGATGATGCA GTCGAAGGTG 550 

50 ATTGACCTGA AGGGCGAGTC GAAGTGCGTG CTTGTGTACG GGCAGATGAA 600 

CGAGCCCCCG GGTGCGCGCG CGCGCGTTGC GCAGTCTGCG CTGACGATGG 650 

CGGAGTACTT CCGAGACGTG GAGGGCCAGA ATGTGCTGCT GTTCATCGAC 70 0 

AACATCTTCC GCTTCACGCA GGCGAACTCC GAGGTCTCTG CGCTGCTGGG 750 

CCGCATTCCG GCCGC CGTGG GCTACCAGCC GACGC TTGCG GAGGATCTTG 800 

55 GTATGCTGCA GGAGCGCATC ACGTCGACGA CGAAGGGGTC GATCACGTCC 8 50 

GTGCAGGCCG TGTACGTGCC TGCGGATGAT ATCACGGATC CGGCGCCCGC 900 

GACGACGTTC TCGCACCTGG AC GCGACGAC TGTGCTGGAC CGCGCGGTGG 950 

CGGAGTCGGG GATCTACCCT GCCGTGAACC CGCTGGAGTG CGCGTCGCGT 10 0 0 

ATCATGGACC CCGATGTGAT CGACGTGGAC CACTACAACG TTGCGCAGGA 105 0 

6 0 TATCGTGCAG ATGCTGACCA AGTACAAGGA GCTGCAGGAT ATCATTGCGG 110 0 
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TGCTTGGTAT CGACGAGCTG AGCGAGGAGG ACAAGGTCGT GGTGGACCGC 1150 
ScSgcaagg TGACCCGGTT CCTGTCGCAG CCGTTCCAGG TTGCGGAGGT 1200 
GTTCACGGGC ATGACGGGCC AC 1222 



2) INFORMATION FOR SEQ ID NO: 861 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 124 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 861 



CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG CGATGAGCCG 50 

CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACCG GCCGCTGCAT 100 

2 5 TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG GTTGTGT CGA 150 

CCGGC GGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT GGGCCGCATC 2 00 

TTCAACGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG TGGGCGAGAA 250 

GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG GATCAGGCCG 300 

CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA CCTGATTCTG 350 

30 CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG CCGGTGTGGG 4 00 

CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG AAGGGCCACG 450 

GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG CGAGGGCACG 500 

GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC TGAAGGGCGA 550 

GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT GAACGAGCCC CCGGGTGCGC 600 

,c GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGC GGAGTA CTTCCGCGAC 650 

GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT TCCGCTTCAC 700 

GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT CCGGCCGCCG 750 

TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT GCAGGAGCGC 800 

ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG CCGTGTACGT 850 

4 0 GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG TTCTCGCACC 900 

TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC GGGCATCTAC 950 

CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG ACCCCGATGT 1000 

GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG CAGATGCTGA 1050 

CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG CATCGACGAG 1100 

45 CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA AGGTGACCCG 1150 

GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG GGCATGACGG 1200 

GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC TGGCCT 1246 



2) INFORMATION FOR SEQ ID NO: 862 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1265 bases 
55 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 3 0815 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862 

GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG 50 

CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACGG 100 

GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG 150 

10 GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT 2 00 

GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG 250 

TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG 3 00 

GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA 3 50 

CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG 400 

15 CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG 4 50 

AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG 500 

CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC 550 

TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT GAACGAGCCC 600 

CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA 650 

2 0 CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAAC AT C T 700 

TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT 750 

CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT 800 

GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG 8 50 

CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG 900 

25 TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC 950 

GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG 1000 

ACCCTGATGT GAT CGATGTG G AC CACTAC A ACGTTGCGCA GGATATCGTG 1050 

CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG 1100 

CATCGACGAG CTGAG CGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA 1150 

3 0 AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG 12 0 0 

GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC 1250 

TGGCCTGCTG ATGGG - 1265 

35 

2) INFORMATION FOR SEQ ID NO: 863 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1191 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863 



ATGGCAAAAG GCAAGTTTGA ACGTACCAAG CCGCACGTGA ACGTGGGTAC 50 

GATTGGTCAC GTTGACCACG GCAAAACGAC GTTGACGGCG GCGATCACGA 100 

CGGTGCTGTC GAACAAGTTC GGCGGCGAGG CTCGCGGCTA CGAC CAGATT 150 

GACGCGGCGC CGGAAGAGAA GGCGCGTGGG ATCACGAT C A ACACCTCGCA 200 

55 CGTTGAGTAC GAGACGGAGA CGCGTCACTA CGCGCACGTT GATTGCCCGG 2 50 

GTCACGCTGA CTACGTGAAG AACATGATCA CGGGTGCTGC GCAGATGGAC 3 00 

GGCGCGATCC TGGTGGTGTC GGCCGCAGAC GGCCCGATGC CGCAGACGCG 350 

CGAGCACATT TTGCTGTCGC GCCAGGTTGG CGTGCCGTAC ATCATCGTGT 400 

TCCTGAACAA GGCGGACATG GTTGATGACG CGGAGCTGCT CGAGCTGGTG 450 

6 0 GAGATGGAAG TCCGCGAACT GCTGAGCAAG TACGATTTCC CGGGCGATGA 50 0 
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CACGCCGATC GTGAATGGTG CGGCCAAGCT 
GCGACCTGGG C GAG CAGGCG ATTC TGTCGC 
TACATTCCGA CGCCGGAGCG CGCGGTCGAC 
GGAAGACGTG TTCTCGATCT CGGGCCGTGG 
5 TCGAGCGCGG CGTGGTGAAG GTTGGCGAGG 
AAGCCGACGG TGAAGACGAC CTGCACGGGC 
GCTGGACCAG GGCCAGGCGG GCGACAACGT 
CCAAGCGTGA AGACGTCGAG CGTGGCCAGG 
ATCAACCCGC ACACGGACTT CACGGCCGAG 
10 AGAGGGTGGC CGTCACACGC CGTTCTTCAA 
ACTTCCGCAC GACGGACGTG ACCGGCACGA 
GAAATGGTGC TGCCGGGCGA CAACGTGTCG 
CCCGATCGCC ATGGAAGAAG GTCTGCGTTT 
GTACCGTCGG TGCCGGCGTC GTCGCCAAGA 

15 



GGCGCTGGAA AGCGACAACG 550 

TGGCGCAAGC GCTGGACACG 600 

GGTGCGTTCC TGATGCCGGT 65 0 

CACGGTGGTG ACTGGCCGTA 700 

AAATC GAAAT CGTGGGCATC 750 

GTGGAGATGT TCCGCAAGCT 8 00 

GGGTATCTTG CTGCGCGGCA 850 

TGCTGGCCAA GCCGGGTTCG 900 

GTGTACATTC TGTCCAAGGA 950 

CGGCTATCGT CCGCAGTTCT 1000 

TCGACCTGCC GGCGGACAAG 1050 

ATGACCGTCA AGCTGCTGGC 1100 

CGCCATCCGT GAAGGCGGTC 1150 

TCATCAAGTA A 1191 



2 ) INFORMATION FOR SEQ ID NO: 8 64 

2 0 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1350 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 
30 (B) STRAIN: LVH/ 75/ US AMRU - K/ 1 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864 

ATGGGAAAGG AAAAGGTGCA CATGAATCTT GTGGTGGTGG GCCACGTCGA 5 0 

3 5 TGCCGGTAAA TCCACTGCAA CGGGTCACTT GATCTACAAG TGCGGTGGTA 100 

TTGACAAACG TACGATCGAG AAGTTCGAGA AAGAAGCTGC CGACATTGGT 150 

AAGGCCTCAT TCAAGTACGC ATGGGTGCTG GACAAGCTGA AGGCTGAGCG 2 00 

CGAACGTGGT ATCACGATCG ACATTGCACT GTGGAAATTC GAGTCACCCA 250 

AGTCTGTCTT CACTATTATT GATGCTCCTG GGCACCGTGA CTTCATCAAG 3 00 

40 AACATGATCA CCGGCACATC GCAAGCCGAC GCAGCCATCC TCATCATTGC 350 

CTCTGCGCAG GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGACAGACCC 4 00 

GCGAGCACGC GTTGCTGGCC TTCACTTTGG GTGTGAAGCA GATGGTTGTG 450 

TGCTGCAACA AAATGGACGA CAAGACTGTG AACTACGGAC AGGAGCGGTA 5 00 

TGACGAGATT GTGAAGGAGG TGTCTGCTTA CATCAAGAAG GTTGGGTACA 550 

45 ACGTGGAGAA GGTGCGCTTC GTCCCCATCT CCGGATGGCA GGGC GACAAC 600 

ATGATTGAGA AAT CCGAGAA GATGCCATGG TACAAGGGTC CAACGCTCCT 6 50 

GGAGGCACTA GACATGCTGG AGCCACCAGT GCGTCCGAGC GACAAGCCCC 700 

TGCGTCTGCC ACTGCAGACG TGTACAAAGA TCGGTGGTAT TGGCACCGTG 750 

CCCGTTGGTC GTGTGGAGAC CGGCGTGATG AAGCCTGGTG ATGTGGTGAC 800 

50 GTTTGCCCCC GCCAACGTGA CGACCGAGGT GAAATCGATC GAGATGCACC 850 

ACGAGCAGCT CGCTGAGGCG ACCCCCGGTG ACAACGTCGG CTTTAACGTG 90 0 

AAGAACGTTT CTGTAAAGGA CATCCGCCGT GGCAACGTCT GCGGTAACAC 950 

CAAGAACGAC CCCCCAAAGG AGGCCGCCGA CTTCACGGCA CAGGTGAT CA 1000 

TCCTGAACCA CCCCGGACAG ATTGGAAACG GTTATGCGCC CGTGCTGGAC 1050 

55 TGCCACACAT CGCACATTGC CTGCAAGTTC GCGGAGATCG AGTCGAAGAT 1100 

CGACCGTCGC TCTGGCAAGG AG CTGGAGAA GGCTCCCAAG TCGATCAAGT 115 0 

CTGGCGACGC CGCGATCGTG CGCATGGTGC CGCAGAAGCC TATGTGCGTG 12 00 

GAGGTCTTCA ACGACTACGC GCCACTCGGC CGCTTTGCCG TGCGTGACAT 1250 

GCGCCAGACC GTCGCTGTCG GTATCATCAA GGCCGTGACC AAGAAGGACG 13 0 0 

6 0 GTTCTGGTGG TAAGGTGACG AAGGCTGCGG TGAAGGCTTC GAAGAAATAA 1350 



618 



